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(54) Title: GRAIN PROCESSING METHOD AND TRANSGENIC PLANTS USEFUL THEREIN 

(57) Abstract: The invention provides novel methods of processing grain, such as com and soybeans, utilizing thioredoxin and/or 
thioredoxin reductase to enhance exiraclability and recovery of starch and protein. The invention further provides novel transgenic 
plants expressing thermostable thioredoxin and/or tliioredoxin reductase. 
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Therein 



This invention relates to improved methods of <rr«™ 
Pianu in such processes " " 



25 



Thioredoxin (TRX) and thioredoxin reductase fTm . 

or »™i digesuon (boU, .fene. .o .^„e„.„ as ' ^""^ 

processes A s Jnd J " ^^i- ^-™S 

P ssmg. A second advantage to supplying ftio^o^ « 

rains is that physical dkmnrinn «f j. """"^ ™»'«lo™ reductase via the 

contact With the s.o!l °°' » 

process„,st:r 

Three „„des of thioredoxin utiB^ion in grain processing are p„vided: 
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1. Expression and action during seed development to alter the composition and quality of 
harvested grain; 

2. Expression (but no activity) during seed development to alter the quality of the products 
upon processing; 

5 3. Production of thioredoxin and/or thioredoxin reductase in grain that is used to alter the 
quality of other grain products by addition during processing. 

The invention described herein is applicable to all grain crops, in particular com, soybean, 
wheat, and barley, most particularly com and soybean, especially com. Expression of 
transgenic thioredoxin and/or thioredoxin reductase in grain is a means of altering the quality 
10 of the material (seeds) going into grain processing, altering the quality of the material derived 
from grain processing, maximizing yields of specific seed components during processing 
(increasing efficiency), changing processing methods, and creating new uses for seed-derived 
fractions or components from milling streams. 

The invention thus provides a plant which expresses a thioredoxin and/or thioredoxin 
15 reductase, e.g. a thioredoxin and/or thioredoxin reductase not naturally expressed in plants, for 
example a plant coniprising a heterologous DNA sequence coding for a thioredoxin stably 
integrated into its nuclear or plastid DNA, preferably under control of an inducible promoter, 
e.g., a chemically-inducible promoter, for example either operatively linked to the inducible 
promoter or under control of transactivator-regulated promoter wherein the corresponding 
20 transactivator is under control of the inducible promoter or is expressed in a second plant such 
that the promoter is activated by hybridization with the second plant; wherein the thioredoxin 
or thioredoxin reductase is preferably thermostable or a eukaryotic reductase; such plant also 
including seed therefor, which seed is optionaDy treated (e.g., primed or coated) and/or 
packaged, e.g. placed in a bag with instmctions for use, and seed harvested therefrom, e.g., for 
25 use in a milling process as described above. 

The transgenic plant of the invention niay optionally further comprise genes for enhanced 
production of thioredoxin reductase and/or NADPH. 

The invention further provides a method for producing a thioredoxin comprising cultivating a 
thioredoxin-expressing plant as described above; a method for producing starch and/or protein 
30 comprising extracting starch or protein from seed harvested from a plant as described above; 
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and a method for wet milling comprising steeping seed from a thioredoxin-expressing plant as 
described above and extracting starch and/or protein therefrom. 

The invention further provides a plant expressible expression cassette conning a coding 
region for a thioredoxin or thioredoxin i^uctase, preferably a thioredoxin derived torn a 
5 thermophilic organism, e.g.. from an archea, for example ^mMethanococcusjannascMi or 
Archaeglobusfulgidus, e.g., as described below, wherein the coding region is preferably 
optimized to contain plant preferred codons. said coding region being operatively linked to 
promoter and terminator sequences which function in a plant, wherein the promotor is 
preferably a seed specific promoter or an inducible promoter, e.g.. a chemically inducible or 

10 transactivator-regulated promoter; for example a plastid or nuclear expressible expression 
cassette comprising a promoter, e.g., a transactivator-mediated promoter regulated by a 
nuclear transactivator (e.g.. the T7 promoter when the transactivator is T7 RNA polymerase 
the expression of which is optionally under control of an inducible promoter). 
The invention further provides a vector comprising such a plant expressible expr^ion 

15 cassette. 

The invention further provides a plant transformed with such a vector, or a transgenic plant 
which comprises in its genome, e.g.. its nuclear or plastid genome, such a plant expressible 
expression cassette. 

The invention also comprises a method of producing grain comprising high levels of 
20 thioredoxin or thioredoxin reductase comprising poDinating a first plant comprising a 

heterologous expression cassette comprising a transactivator-mediated promoter regulated and 
operatively linked to a DNA sequence coding for a thioredoxin or thioredoxin reductase the 
first plant preferably being emasculated or male sterile, with poDen from a second plant 
comprising a heterologous expression cassette comprising a promoter operative^ linked to a 
5 DNA sequence coding for a transactivator capable of regulating said transactivator-mediated 
promoter, and recovering grain from the plant thus pollinated. 
The invention also provides a nucieic acid molecule con^rising a nucleotide sequence 
encoding eukao'otiolr^uto;,.^ NADPIT dependent thioredoxin reductase (NTR) wherein 
the nucleotide sequence is optimized for expression in a monocotyledonous plant, preferably 
) optmuzed for expression in maize. The nucleotide sequence is preferably the nucleotide 
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sequence of SEQ ID NO:24 and preferably encojJes the amino acid sequence of SEQ ID 
NO:25. 

The invention also provides an isolated nucleic molecule comprising a nucleotide sequence 
encoding eucaryotic rice NADPH"^ dependent thioredoxin reductase (NTR). The nucleotide 
5 sequence preferably encodes the amino acid sequence of SEQ ID NO:27. The nucleotide 
sequence is preferably the nucleotide sequence of SEQ ED NO:25. 

BRIEF DESCRIPTION OF THE SEQUENCE USTING 

10 SEQ ID NO: 1 - Protein sequence of thioredoxin from Methanococcus jannaschii 

(gill591029). 

SEQIDNO:2 - Protein sequence of thioredoxin from Arc/iaeog/o£»i«/u/giJMj 
(gil2649903)(trx-l). 

SEQ ID NO:3 - Protein sequence of thioredoxin from Archaeoglobusfulgidus (gil2649838) 
15 (trx-2). 

SEQIDNO:4 - Protein sequence of thioredoxin from ArcAaeog totMjyi</fidMj (gil2649295) 
(trx-3). 

SEQ ID NO:5 - Protein sequence of thioredoxin from Archaeoglobusfulgidus (gil2648389) 
(trx-4). • 
20 SEQ ID N0:6 - Protein sequence of thioredoxin reductase (trxB) from Methanococcus 

jannaschii (gil 15921 67). 
SEQIDNO:7 - Protein sequence of thioredoxin reductase from Arc/iaeog/oi>i«/u/gufMj 

(gil2649006) (trxB). 
SEQ ID NO:8 - Primer NMD109. 
25 SEQIDN0:9 - Primer NMDllO. 
SEQ ID NO: 10 - Primer NMD102. 
SEQ ID N0:11 - Primer NMD103. 
SEQ ID N0:12 - Primer NMD124A. 
SEQ ID NO:13 - Primer NMD125A. 
30 SEQIDNO:14-PrimerNMD126. 
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SEQ ID NO: 15 - Primer KnviD127. 
SEQ ID NO:16 - Primer NMD128. 
SEQ ID NO: 17 - Primer NMD 129. 
SEQ ID NO:] 8 - Primer STRFIA. 
SEQ ID NO: 1 9 - Primer STRFIB. 
SEQ ID NO:20 - Primer STRF2A. 
SEQ BD NO:21 - Primer STRF2B. 
SEQ ID NO:22 - Primer STR3 A. 
SEQ ID NO:23 - Primer STR3B. 

SEQ ID NO:24 - Maize opting Arabidopsis NADPH dependent thioredoxin reductase 

coding sequence. 
SEQ ID NO:25 - Amino acid sequence encoded by SEQ ID NO:24. 
SEQ ID NO:26 - Rice NADPH dependent thioredoxin reductase (NTR) coding sequence. 
SEQ ID NO:27 - Amino acid sequence encoded by SEQ ID NO:26. 
15 SEQ ID NO:28- Primer P9. 
SEQ ID NO:29 - Primer PIO. 
SEQ ID NO:30 - Primer P4. 
SEQ ID NO:3l -Primer PI. 
SEQ ID NO:32 - Primer P2. 
20 SEQ ID NO:33- Primer P5. 
SEQ ID NO:34 - Primer P12. 
SEQ ID NO:35- Primer PI 1. 
SEQ ID NO:36 - Primer P27. 
SEQ ID NO:37- Primer P28. 
25 SEQ ID NO:38 - Primer P29. 

SEQ ID NO:39 - Primer P26. 

SEQ ID NO:40 - Primer P3 1 . 

SEQ ID NO:41 - Primer Thiorodoxubi 1603. 

SEQ ID NO:42 - Primer Thiorodox 2364. 

30 
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"Associated with / operatively linked" refer to two nucleic acid sequences that are related 
physically or functionally. For example, a promoter or regulatory DNA sequence is said to be 
"associated with" a DNA sequence that codes for an RNA or a protein if the two sequences 
are operatively linked, or situated such that the regulator DNA sequence will affect the 
5 expression level of the coding or structural DNA sequence. 

A "chimeric gene" is a recombinant nucleic acid sequence in which a promoter or regulatory 
nucleic acid sequence is operativety linked to, or associated with, a nucleic acid sequence that 
codes for an mRNA or which is expressed as a protein, such that the regulator nucleic acid 
sequence is able to regulate transcription or expression of the associated nucleic acid sequence. 
10 The regulator nucleic acid sequence of the chimeric gene is not normally operatively linked to 
the associated nucleic acid sequence as found in nature. 

A "coding sequence" is a nucleic acid sequence that is transcribed into RNA such as mRNA, 
rRNA, tRNA, snRNA, sense RNA of antisense RNA. Preferably the RNA is then translated in 
an organism to produce a protein. 
15 Con^lementary: "complementary" refers to two nucleotide sequences that comprise 

antiparaDel nucleotide sequences capable of pairing with one another upon formation of 
hydrogen bonds between the coniplementary base residues in the antiparallel nucleotide 
sequences. 

DNA Shufifling: DNA shuffling is a method to rapidly, easily and efTiciently introduce 
20 mutations or rearrangements, preferably randomly, in a DNA molecule or to generate 

exchanges of DNA sequences between two or more DNA molecules, preferably randomly. The 
DNA molecule resulting from DNA shuffling is a shuffled DNA molecule that is a non- 
naturally occurring DNA molecule derived from at least one template DNA molecule. The 
shuffled DNA encodes an enzyme modified with respect to the enzyme encoded by the 
25 template DNA, and preferably has an altered biological activity with respect to the enzyme 
encoded by the template DNA. 

Enzyme/Protein Activity: means herein the ability of an enzyme (or protein) to catalyze the 
conversion of a substrate into a product. A substrate for the enzyme comprises the natural 
substrate of the enzyme but also comprises analogues of the natural substrate, which can also 
30 be converted, by the enzyme into a product or into an analogue of a product. The activity of 
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the enzyme is measured for example by determining the amount of product in the reaction after 
a certain period of time, or by determining the amount of substrate remaining in the reaction 
mixture after a certain period of time. The activity of the enzyme is also measured by 
determining the amount of an unused co-fector of the reaction remaining in the reaction 
mixture after a certain period of time or by determining the amount of used co-factor in the 
reaction mixture after a certain period of time. The actwity of the enzyme is also measured by 
determining the amount of a donor office energy or energy-rich molecule (e.g. ATP, 
phosphoenolpyruvate. acetyl phosphate or phosphocreatine) remaining in the reactioii mixture 
after a certain period of time or by determining the amount of a used donor of free energy or 
energy-rich molecule (e.g. ADP, pyruvate, acetate or creatine) in the reaction mixture after a 
certain period of tinae. 

Expression Cassette: "Expression cassette" as used herein means a DNA sequence capable of 
directing expression of a particular nucleotide sequence in an appropriate host cell, comprising 
a promoter operatively linked to the nucleotide sequence of interest which is operatively linked 
to termination signals. It also typically comprises sequences required for proper translation of 
the nucleotide sequence. The coding region usually codes for a protein of interest but may also 
code for a functional RNA of interest, for example antisense RNA or a nontranslated RNA in 
the sense or antisense direction. The expression cassette comprising the nucleotide sequence of 
interest may be chimeric, meaning tiiat at least one of its components is heterologous with 
respect to at least one of its other components. The expression cassette may also be one which 
is naturaUy occurring but has been obtained in a recombinant form useful for heterologous 
expression. Typically, however, the expression cassette is heterologous with respect to the 
host. i.e., the particular DNA sequence of the expression cassette does not occur naturally in 
the host ceU and must have been introduced into the host cell or an ancestor of the host cell by 
a transformation event. The expression of the nucleotide sequence in the expression cassette 
may be under the control of a constitutive promoter or of an inducible promoter which initiates 
transcnption only when the host cell is exposed to some particular external stimulus. In the 
case of a multiceUular organism, such as an insect, the promoter can also be specific to a 
particular tissue or organ or stage of development. 
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Gene: the term "gene" is used broadly to refer to any segment of DNA associated with a 
biological function. Thus, genes include coding sequences and/or the regulatory sequences 
required for their expression. Genes also include nonexpressed DNA segments that, for 
example, form recognition sequences for other proteins. Genes can be obtained from a variety 

5 of sources, including cloning from a source of interest or synthesizing from known or predicted 
sequence information, and may include sequences designed to have desired parameters. 
Heterologous DNA Sequence: The terms 'lietarologoas DNA sequrace", "exogenous DNA segment" or 
"heterologous nucleic acid," as used herein, each refer to a sequmce that originates from a source 
foreign to the particular host cell or, if from the same source, is modified from its original form. Thus, a 

10 heterologous gene in a host cell includes a gene that is endogenous to the particular host cell but has 
been modified through, for example, the use of DNA shuffling. The terms also includes non-naturally 
occurring multiple copies of a naturally occurring DNA sequence. Thus, the terms refer to a DNA 
segment that is foreign or heterologous to the cell, or homologous to the cell but in a position within the 
host cell nucleic acid in which the element is not ordinarily found. Exogenous DNA segments are 

15 expressed to yield exogenous polyp^tides. 

Homologous DNA Sequence: a DNA sequence naturally associated with a host cell. 
"Homoplastidic" refers to a plant, plant tissue or plant cell wherein all of the plastids are 
genetically identical. This is the normal state in a plant when the plastids have not been 
transfomnied, mutated, or otherwise genetically altered. In dilBferent tissues or stages of 

20 development, the plastids may take different forms, e.g., chloroplasts, proplastids, etioplasts, 
amyloplasts, chromoplasts, and so forth. 

Isolated: in the context of the present invention, an isolated DNA molecule or an isolated 
enzyme is a DNA molecule or protein which, by the hand of man, exists apart from its native 
environment and is therefore not a product of nature. An isolated DNA molecule or protein 
25 may exist in a purified form or may exist in a non-native environment such as, for example, in a 
transgenic host cell. 

Mature Protein: protein that is normally targeted to a cellular organelle and from which the 
transit peptide has been removed. 

Minimal Promoter: promoter elements, particularly a TATA element, that are inactive or that 
30 have greatly reduced promoter activity in the absence of upstream activation. In the presence 
of a suitable transcription factor, the minimal promoter functions to permit transcription. 
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Modified Enzyme Activity: enzyme activity different from that which naturally occurs in an 
insect (i.e. enzyme activity that occurs naturaUy in the absence of direct or indirect 
manipulation of such activity by man), which is tolerant to inhibitors that inhibit the naturally 
occurring enzyme activity. 

Native: refers to a gene that is present in the genome of an untransformed insect cell. 
Naturally occurring: the term "naturally occurring" is used to describe an object that can be 
found in nature as distinct from being artificially produced by man. For example, a protein or 
nucleotide sequence present in an organism (including a virus), which can be isolated from a 
source in nature and which has not been intenUonally modified by man in the laboratory, is 
naturally occurring. 

Nucleic acid: the term "nucleic acid" refers to deoxyribonucleotides or ribonucleotides and 
polymers thereof in either single- or double-stranded form. Unless specifically limited, the term 
encompasses nucleic acids containing known analogues of natural nucleotides which have 
similar binding properties as the reference nucleic acid and ^ metabolized in a manner similar 
to naturally occurring nucleotides. Unless otherwise indicated, a particular nucleic acid 
sequence also implicitly encompasses conservatively modified variants thereof (e.g. degenerate 
codon substitutions) and con^lementary sequences and as wefl as the sequence expUcitly 
indicated. Specifically, degenerate codon substitutions may be achieved by generating 
sequences in which the third position of one or more selected (or all) codons is substituted 
with mixed-base and/or deoxyinosine residues (Batzer et ai. Nucleic Acid Res. 19: 5081 
(1991); Ohtsuka et ai. J. Biol. Chem. 260: 2605-2608 (1985); Rossolini et al, Mol Cell 
Probes 8: 91-98 (1994)). The terms "nucleic acid" or "nucleic acid sequence" may also be used 
interchangeably with gene, cDNA. and mRNA encoded by a gene. 
A "plant" is any plant at any stage of development, particularly a seed plant. 
A "plant ceU" is a structural and physiological unit of a plant, comprising a protoplast and a 
cell wall. The plant cefl may be in form of an isolated single cell or a cultured cell, or as a part 
of higher organized unit such as, for example, plant tissue, a plant organ, or a whole plant. 
•Tlant cell culture" means cultures of plant units such as, for example, protoplasts, cell culture 
cells, cells in plant tissues, pollen, pollen tubes, ovules, embryo sacs, zygotes and embryos at 
various stages of development. 
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*Tlaiit material" refers to leaves, stems, roots, flowers or flower parts, fruits, pollen, egg cells, 
zygotes, seeds, cuttings, cell or tissue cultures, or any other part or product of a plant. 
A *'plant organ" is a distinct and visibly structured and differentiated part of a plant such as a 
root, stem, leaf, flower bud, or embryo. 

5 *Tlant tissue" as used herein means a group of plant cells organized into a structural and 

functional unit. Any tissue of a plant in planta or in culture is included. This term includes, but 
is not limited to, whole plants, plant organs, plant seeds, tissue culture and any groups of plant 
cells organized into structural and/or functional units. The use of this term in conjunction with, 
or in the absence of, any specific type of plant tissue as listed above or otherwise embraced by 

10 this definition is not intended to be exclusive of any other type of plant tissue. 

A "promoter" is an untranslated DNA sequence upstream of the coding region that contains 
the binding site for RNA polymerase n and initiates transcription of the DNA. The promoter 
region may also include other elements that act as regulators of gene expression. 
A "protoplast" is an isolated plant cell without a cell wall or with only parts of the cell wall. 

15 Purified: the term "purified," when applied to a nucleic acid or protein, denotes that the nucleic 
acid or protein is essentially free of other cellular components with which it is associated in the 
natural state. It is preferably in a homogeneous state although it can be in either a dry or 
aqueous solution. Purity and homogeneity are typically detennined using analytical chemistry 
techniques such as polyacrylamide gel electrophoresis or high perfomaance liquid 

20 chromatography. A protein which is the predominant species present in a preparation is 

substantially purified. The term "purified" denotes that a nucleic acid or protein gives rise to 
essentially one band in an electrophoretic gel. Particularly, it means that the nucleic acid or 
protein is at least about 50% pure, more preferably at least about 85% pure, and most 
preferably at least about 99% pure. 

25 ••Regulatory elements" refer to sequences involved in controlling the expression of a nucleotide 
sequence. Regulatory elements comprise a promoter operatively linked to the nucleotide 
sequence of interest and termination signals. They also typically encompass sequences required 
for proper translation of the nucleotide sequence. 

Significant Increase: an increase in enzymatic activity that is larger than the margin of error 
30 inherent in the measurenfient technique, preferably an increase by about 2-fold or greater of the 
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activity of the wild-type enzyme in the presence of the inhibitor, more preferably an increase by 
about 5-fold or greater, and most preferably an increase by about 10-fold or greater. 
The terms "identical" or percent "identity" in the context of two or more nucleic acid or 
protein sequences, refer to two or more sequences or subsequences that are the same or have a 
specified percentage of amino acid residues or nucleotides that are the same, when compared 
and aligned for maximum correspondence, as measured using one of the following sequence 
comparison algorithms or by visual inspection. 

Substantially identical: the phrase "substantially identical." in the context of two nucleic acid or 
protein sequences, refers to two or more sequences or subsequences that have at least 60% 
preferably 80%. more preferably 90-95%. and most preferably at least 99% nucleotide or ' 
amino acid residue identity, when compared and aligned for maximum correspondence as 
measured using one of the following sequence comparison algorithms or by visual inspection 
Preferably, the substantial identity exists over a region of the sequences that is at least about 50 
residues in lengti,, more preferably over a region of at least about 100 residues, and most 
preferably the sequences are substantially identical over at least about 150 residues. In a most 
preferred embodiment, the sequences are substantially identical over the entire length of the 
coding regions. Furthermore, substantially identical nucleic acid or protein sequences perform 
substantially tb& same function. 

For sequence comparison, typically one sequence acts as a reference sequence to which test 
sequences are compared. When using a sequence comparison algorithm, test and reference 
sequences are input into a computer, subsequence coordinates are designated if necessary and 
sequence algorithm program parameters are designated. The sequence comparison algorithm 
then calculates the percent sequence identity for the test sequence(s) relative to ti.e reference 
sequence, based on the designated program parameters. 

Optimal alignment of sequences for comparison can be conducted. e.g.. by the local homology 
algonthm of Smiti, & Waterman. Adv. Appl. Math. 2: 482 ( 1981). by the homology alignment 
algonthm of Needleman & Wunsch. J. Mol. Biol. 48: 443 (1970). by the search for similarity 
metiiod of Pearson & Lipman. Proc. Nat,. Acad. Sci. USA 85: 2444 (1988), by computerized 
UT,,lementations of these algorithms (GAP. BESTHT. FASTA. and TFASTA in the Wisconsin 
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Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, WI), or by 
visual inspection (see generally, Ausubel et al, infra). 

One example of an algorithm that is suitable for determining percent sequence identity and 
sequence similarity is the BLAST algorithm, which is described in Altschul et al, J, Mol Biol. 
5 215: 403-410 (1990). Software for performing BLAST analyses is publicly available through 
the National Center for Biotechnology Information (http://www.ncbi.nhn.nih.gov/). This 
algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short 
words of length W in the query sequence, which either, n^tch or satisfy some positive- valued 
threshold score T when aligned with a word of the same length in a database sequence. T is 
10 referred to as the neighborhood word score threshold (Altschul et al, 1990). These initial 
neighborhood word hits act as seeds for initiating searches to find longer HSPs containing 
them. The word hits are then extended in both directions along each sequence for as far as the 
cumulative alignment score can be increased. Cumulative scores are calculated using, for 
nucleotide sequences, the paranneters M (reward score for a pair of matching residues; always 
15 > 0) and N (penalty score for mismatching residues; always < 0). For amino acid sequences, a 
scoring matrix is used to calculate the cumulative score. Extension of the word hits in each 
direction are halted when the cumulative alignment score falls off by the quantity X from its 
maximum achieved value, the cumulative score goes to zero or below due to the accumulation 
of one or more negative-scoring residue alignments, or the end of either sequence is reached. 
20 The BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the 
alignment. The BLASTN program (for nucleotide sequences) uses as defaults a wordlength 
(W) of 1 1, an expectation (E) of 10, a cutoff of 100, M=5, N=-4, and a comparison of both 
strands. For amino acid sequences, the BLASTP program uses as defaults a wordlength (W) of 
3, an expectation (E) of 10, and the BLOSUM62 scoring matrix (see HenikofF & Henikoff, 
25 Proc. NatL Acad. ScL USA 89: 10915 (1989)). 

In addition to calculating percent sequence identity, the BLAST algorithm also performs a 
statistical analysis of the similarity between two sequences (see, e.g., Karlin & Altschul, Proc. 
Natt Acad. Sci, USA 90: 5873-5787 (1993)). One measure of similarity provided by the 
BLAST algorithm is the smallest sum probability (P(N)), which provides an indication of the 
30 probability by which a match between two nucleotide or annino acid sequences would occur by 
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Chance. For example, a test nucleic acid sequence is considered sin^ar to a^eference sequence 
If the smallest sum probability in a comparison of the test nucleic acid sequence to the 
reference nucleic acid sequence is less than about 0.1. more preferably less than about 0.01. 
and most preferably less than about 0.(X)I. 

Aoother Mication ftat ,wo mclcic acid seqacK^s a« substandally idcticaj fta, the two 
moleoute hybridize to each other under stringent conditions. TV phrase "hybridizing 
^ificaily to" refers to the binding, duplexing, or hybridizing of a molecule only to a 
pa^cular nucleotide sequence under stringent condiUons when that sequence is present to a 
con,lex mixture (e.g.. total celtalar) DNA or RNA. •Bind(s) substantially" refers to 
con^lementary hyWdization between a probe nucleic acid and a target nucleic acid and 
ernhnK^es ndnor nusmatehes a,at can be accommodated by reducing the stringency of the 
bybndizaUon media to achieve the desired detection of the target nucleic acid sequence 
"Stringent hybridization conditions" and "stringent hybridization wash conditions" in the 
context of nucleic acid hybridizaUon experim«Ks such as Southern and Northern hybridizations 
are sequence dependent, and are different under diffaent enviromnental parameter.. Longer 
sequences hybridize speciiically a. higher tempetan^. An extensive guide to a» hybridization 
Of nucleic acids is found in Tijssen (1993) LaBor^ Techniques in Biochemist^ w 
MoUcularBiology-Hybridl^n Nuclei. Acid Probes par, I chapter 2 "Overview of 
pr^ciples of hybridization a^i the strategy of nucleic acid probe assays" Elsevier. New York 
Generally, highly stringent hybridization and wash conditions are selected to be about 5-C 
lower than the U,en„al melttog point (T J for the specific sequence at a defined ionic strength 
and pa TypicaUy. under "stiingent conditions" a probe will hybridize to its target 
subsequence, but to no other sequences. 

The T. is the temperan.re (under defined ionic strength and pH) at which 30% of ti„ target 
sequence hybridizes to a perfectly matched probe. Very stringent conditions are selected to be' 
equal to ^ T„ for a particular probe. An example of stringent hybridization conditions for 
hybrxlization of eomptanentary nucleic acids which have more U«n 100 complementary 
residues on a filter in a Southern or „ort>em blot is 50% fonnamide with 1 mg of heparin at 
42"C, w«h the hybridization being carried out overnight. An example of highly stringent wash 
co^htions is 0.1 5M Naa a. 7rc for alx>„t 15 mtoutes. An example of stringent wash 
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conditions is a 0.2x SSC wash at 65'*C for 15 minutes {see, Sambrook, infra, for a description 
of SSC buffer). Often, a high stringency wash is preceded by a low stringency wash to remove 
background probe signal. An example medium stringency wash for a duplex of, e.g., more than 
100 nucleotides, is Ix SSC at 45**C for 15 minutes. An example low stringency wash for a 

5 duplex of, e.g., more than 100 nucleotides, is 4-6x SSC at 40''C for 15 minutes. For short 
probes {e.g., about 10 to 50 nucleotides), stringent conditions typicaDy involve salt 
concentrations of less than about l.OM Na ion, typically about 0.01 to 1.0 M Na ion 
concentration (or other salts) at pH 7.0 to 8.3, and the temperature is typically at least about 
30**C. Stringent conditions can also be achieved with the addition of destabilizing agents such 

10 as formamide. In general, a signal to noise ratio of 2x (or higher) than that observed for an 
unrelated probe in the particular hybridization assay indicates detection of a specific 
hybridization. Nucleic acids that do not hybridize to each other under stringent conditions are 
stiD substantially identical if the proteins that they encode are substantially identical. This 
occurs, e.g., when a copy of a nucleic acid is created using the maximum codon degeneracy 

15 permitted by the genetic code. 

The following are exmnples of sets of hybridization/wash conditions that may be used to clone 
homologous nucleotide sequences that are substantially identical to reference nucleotide 
sequences of the present invention: a reference nucleotide sequence preferably hybridizes to 
the reference nucleotide sequence in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04, 1 mM 

20 EDTA at 50°C with washing m 2X SSC. 0. 1 % SDS at 50**C, more desirably in 7% sodium 
dodecyl sulfate (SDS), 0.5 M NaP04. 1 mM EDTA at 50^C with washing in IX SSC, 0.1% 
SDS at SO'^C, more desirably still in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04, 1 mM 
EDTA at 50X with washing in 0.5X SSC, 0.1% SDS at 50**C, preferably in 7% sodium 
dodecyl sulfate (SDS), 0.5 M NaP04, 1 mM EDTA at SO^'C with washing in O.IX SSC, 0.1% 

25 SDS at 50**C, more preferably in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04, 1 mM 
EDTA at 50°C with washing in 0. IX SSC, 0. 1 % SDS at 65°C. 

A further indication that two nucleic acid sequences or proteins are substantially identical is 
that the protein encoded by the first nucleic acid is immunologically cross reactive with, or 
specifically binds to, the protein encoded by the second nucleic acid. Thus, a protein is typically 
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substantiany identical to a second protein, for example, where the two proteins differ only by 
conservative substitutions. 

The phrase "specificaDy (or selectively) binds to an antibody." or "specifically (or selectively) 
immunoreactive with," when refexxing to a protein or peptide, refers to a binding reaction 
which is determinative of ti^e presence of the protein in the presence of a heterogeneous 
population of proteins and other biologies. Thus, under designated ixnmunoassay conditions 
the specified antibodies bind to a particular protein and do not bind in a significant amount to 
other proteins present in the sample. Specific binding to an antibody under such conditions may 
requ^e an antibody that is selected for its specificity for a particular protein. For example 
antibodies raised to the protein with the amino acid sequence encoded by any of the nucleic 
aad sequences of tiie invention can be selected to obtain antibodies specifically 
iinmunoreactive with that protein and not with other proteins except for polymorphic variants. 
A variety of unmunoassay formats may be used to select anhlx>dies specifically 
ixnmunoreactive with a particular protein. For example, solid-phase EUSA immunoassays 
Western blots, or immunohistochemistiy are routinely used to select monoclonal antibodies 
specifically iirnnunoreactive with a protein. See Harlow and Lane (1988) Antibodies A 
iMboratory Manual. Cold Spring Harbor Publications. New York "Harlow and Lane") for a 
description of immunoassay formats and conditions that can be used to determine specific 
immunoreactivity. Typicafly a specific or selective reaction will be at least twice background 
signal or noise and more typically more than 10 to 100 times background. 
"Conservatively modified variations" of a particular nucleic acid sequence refers to tiiose 
nucleic acid sequences that encode identical or essentially identical amino acid sequences or 
where the nucleic acid sequence does not encode an amino acid sequence, to essentially 
Identical sequences. Because of Uie degeneracy of the genetic code, a large number of 
fiinctionally identical nucleic acids encode any given polypeptide. For instance the codons 
CGT, CGC. CGA. CGG. AGA. and AGG all encode the amino acid arginine. Thus, at every 
position where an arginine is specified by a codon. the codon can be altered to any of the 
corresponding codons described without altering the encoded protein. Such nucleic acid 
vanations are "silent variations" which are one species of "conservatively modified variations " 
Every nucleic acid sequence described herein which encodes a protein also describes evei^ 
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possible silent variation, except where otherwise noted. One of skill will recognize that each 
codon in a nucleic acid (except ATG, which is ordinarily the only codon for methionine) can be 
modified to yield a functionally identical molecule by standard techniques. Accordingly, each 
"silent variation" of a nucleic acid which encodes a protein is implicit in each described 
5 sequence. 

Furthermore, one of skill will recognize that individual substitutions deletions or additions that 
alter, add or delete a single amino acid or a small percentage of amino acids (typically less than 
5%, more typically less than 1%) in an encoded sequence are "conservatively modified 
variations," where the lEiIterations result in the substitution of sin amino acid with a chemically 

10 similar amino acid. Conservative substitution tables providing functionally similar amino acids 
are weD known in the art. The following five groups each contain amino acids that are 
conservative substitutions for one another: Aliphatic: Glycine (G), Alanine (A), Valine (V), 
Leucine (L), Isoleucine (I); Aromatic: Phenylalanine (F), Tyrosine (Y), Tryptophan (W); 
Sulfiir-containing: Methionine (M), Cysteine (C); Basic: Arginine (R), Lysine (K), Histidine 

15 (H); Acidic: Aspartic acid (D), Glutamic acid (E), Asparagine (N), Ghitamine (Q), See also, 
Creighton (1984) Proteins, W.H. Freeman and Conipany. In addition, individual substitutions, 
deletions or additions which alter, add or delete a single amino acid or a small percentage of 
amino acids in an encoded sequence are also "conservatively modified variations." 
A "subsequence" refers to a sequence of nucleic acids or amino acids that comprise a part of a 

20 longer sequence of nucleic acids or amino acids (e.g., protein) respectively. 

Nucleic acids are "elongated" when additional nucleotides (or other analogous molecules) are 
incorporated into the nucleic acid. Most commonly, this is performed with a polymerase (e.g., 
a DNA polymerase), e.g., a polymerase which adds sequences at the 3' terminus of the nucleic 
acid. 

25 Two nucleic acids are "recombined" when sequences from each of the two nucleic acids are 
combined in a progeny nucleic acid. Two sequences are "directly" recombined when both of 
the nucleic acids are substrates for recombination. Two sequences are "indirectly recombined" 
when the sequences are recombined using an intermediate such as a cross-over 
oligonucleotide. For indirect recombination, no more than one of the sequences is an actual 
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substrate for recombination, and in some cases, neither sequence is a substrate for 
recombination. 

"Synthetic" refers to a nucleotide sequence coniprising structural characters that are not 
present in the natural sequence. For example, an artificial sequence that resembles more closely 
the G+C content and the normal codon distribution of dicot and/or monocot genes is said to be 

synthetic. 

A "transactivator" is a protein which, by itself or in combination with one or more additional 
proteins, is capable of causing transcription of a coding region under control of a 
corresponding transactivator-mediated promoter. Examples of transactivator systems include 
phage T7 gene 10 promoter, the transcriptional activation of which is dependent upon a 
specific RNA polymerase such as the phage T7 RNA polymerase. The transactivator is 
typically an RNA polymerase or DNA binding protein capable of interacting with a particular 
promoter to initiate transcription, either by activating the promoter directly or by inactivating a 
repressor gene, e.g., by suppressing expression or accumulation of a repressor protein. The 
DNA binding protein may be a chimeric protein conprising a binding region (e.g., the GAL4 
bindmg region) linked to an appropriate transcriptional activator domain. Some transactivator 
systems may have multiple transactivators, for example promoters which require not only a 
polymerase but also a specific subunit (sigma factor) for promotor recognition. DNA binding 
or transcriptional activation. The transactivator is preferably heterologous with respect to thi 
plant. 

Tr^fomiation: a process for introducing heterologous DNA into a cell, tissue, or insect 
Transfom^d cells, tissues, or insects are understood to encompass not orfy the end product of 
a transformation process, but also transgenic progeny thereof 

•Transformed," "transgenic." and "recombinant" refer to a host organism such as a bacterium 
or a plant into which a heterologous nucleic acid molecule has been introduced. The nucleic 
acid molecule can be stably integrated into the genome of the host or the nucleic acid molecule 
can also be present as an extrachromosomal molecule. Such an extrachromosomal molecule 
can be auto-replicating. Transformed cells, tissues, or plants are understood to encompass not 
only the end product of a transfomiation process, but also transgenic progeny thereof A "r 
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• transformed," "non-transgenic," or "non-recombinant" host refers to a wild-type organism, 
e.g., a bacterium or plant, which does not contain the heterologous nucleic acid molecule. 
Nucleotides are indicated by their bases by the following standard abbreviations: adenine (A), 
cytosine (C), thymine (T), and guanine (G). Amino acids are likewise indicated by the 

5 following standard abbreviations: alanine (Ala; A), arginine (Arg; R), asparagine (Asn; N), 
aspartic acid (Asp; D), cysteine (Cys; Q, glutamine (Gin; Q), glutamic acid (Glu; E), glycine 
(Gly; G), histidine (His; H), isoleucine (He; I), leucine (Leu; L), lysine (Lys; K), methionine 
(Met; M), phenylalanine (Phe; F), proline (Pro; P), serine (Ser; S), threonine (Thr; T), 
tryptophan (Trp; W), tyrosine (Tyr; Y), and valine (Val; V). Furthermore, (Xaa; X) represents 

10 any amino acid. 

Wet milling 

Wet milling is a process of separating the starch, protein and oil conponents of grain, most 
often cereals, for example com. It is distinguished herein from dry milling, which is simply 

15 pulverizing grain. Com wet milling is comprised of the steps of steeping, grinding the com 
kernel and separating the coii5)onehts of the kernel. The first step in wet milling is usually 
steeping, wherein the grain is soaked in water under carefully controlled conditions to soften 
the kernels and facilitate separation of the components. The kernels are typically steeped in a 
steep tank with a countercurrent flow of water at about 120''F containing sulfur dioxide at a 

20 concentration of about 0.2% by weight. The kernels remain in the steep tank from about 24 to 
48 hours. The kernels are then dewatered and subjected to sets of attrition type mills. The first 
set of attrition type mills rupture the kernels releasing the germ, and com oil from the rest of 
the kernel. Centrifugation is used to separate the germ from the rest of the kernel. The oil- 
bearing embryos float to the surface of the aqueous solution and are removed. 

25 Next, by processes of watering and dewatering, milling, screening, centrifuging and washing, 
the starch is separated from the protein and purified. Following embryo removal, the remaining 
kernel components including the starch, hull, fiber, and gluten are subjected to another set of 
attrition mills and passed through a set of wash screens to separate the fiber components from 
the starch and gluten. The starch and gluten pass through the screens while the fiber does not. 

30 Centrifugation or a third grind foDowed by centrifugation is used to separate the starch from 
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the protein. Centriftigatjon produces a slurry containing the starch granules, which is 
dewatered. washed with fresh water and dried to about 12% moisture. The result is the 
recovery of a fraction of substantially pure starch from the com kernels in this manner. 
The key difficulty is to loosen starch granules from the complicated matrix of proteins and ceD 
wall material that makes up the endospenn of the grain. One reason for this difficulty is 
believed to be the presence of inter- or intra-molecular disulfide bonds, which render the 
protein matrix less soluble and less susceptible to proteolytic enzymes and inhibit release of the 
starch granules from the protein matrix in the grain. At present, the primary means for reducing 
these bonds is to steep the grain in the presence of sulfur dioxide, but this is costly, 
environmentaUy unfiiendly, and not optimally effective. Because the steep water contains 
sulfur dioxide it is considered toxic waste, and therefore minimizing the volume generated 
would be advantageous. Alternatively, the requirement for sulfur dioxide would be eliminated. 
Reducing the steep times that are required for grain conditioning prior to milling is an 
additional advantage of reducing the extent of disulfide bonds in the endosperm matrix. 
Certain mutations exert beneficial effects on the protein matrix of com kernel endospemi 
(floury and opaque), but impair kernel integrity. Transgenic thioredoxin expression provides 
some of these advantages without creating some of the kernel integrity problems associated 
with these mutations. 

Post-harvest or processing-dependent activities of thioredoxin have equally beneficial effects. 
For example, in one embodiment, thioredoxin and/or thioredoxin reductase enzymes are 
targeted to and accumulated in cell compartments. Protein reduction occurs following physical 
disruption of the seed. In another embodiment, quiescent endosperm thioredoxin and/or 
thioredoxin reductase is activated upon steeping. In a preferred embodiment, the invention 
provides a plant expressing a transgenic thermostable thioredoxin and thioredoxin reductase, 
e.g. a thioredoxin and thioredoxin reductase derived from a hyperthermophilic organism, such 
that the thioredoxin and thioredoxin reductase are not significantly active except at high 
temperatures (e.g., greater than 50°Q. In one embodiment, the thermostable thioredoxin and 
thioredoxin reductase are synergistic with saccharification via expression of other themiostable 
enzymes in endosperm. 
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Feed applications 

Expression of transgenic thioredoxin and/or thioredoxin reductase in grain is also useful to 
improve grain characteristics associated with digestibility, particularly in animal feeds. 
Susceptibility of feed proteins to proteases is a function of time and of protein conformation. 
5 Kernel cracking is often used in feed formulation as is steam flaking. Both of these processes 
are designed to aid kernel digestibility. Softer kernels whose integrity can be disrupted more 
easily in animal stomachs are desirable. Conformational constraints and crosslinks between 
proteins are major determinants of protease susceptibility. Modifying these bonds by increased 
thioredoxin and/or thioredoxin reductase expression thereby aids digestion. 

10 

Com dry milling/masa 

Protein content and quality are important determinants in flaking grit production and in masa 
production. Reduction of disulphide bonds alters the nature of com flour such that it is suitable 
for use as a wheat substitute, especially flours made from high-protein white com varieties. 

15 

Sovbean crushing 

Over half of the US soybean crop is crushed or milled, and the protein quality in the resulting 
low-fat soy flour or de-fatted soy flour (or soybean meal) is important for subsequent 
processing. Protein yield and quality from soybean processing streams are economically 

20 important, and are largely dependent upon protein conformation. Increasing thioredoxin 

activity through expression of transgenic thioredoxin and/or thioredoxin reductase increases 
protein solubility, and thus increases yield, in the water-soluble protein fractions. Recovery is 
facilitated by aqueous extraction of de-fatted soybean meal under basic conditions. Enhancing 
thioredoxin activity through expression of transgenic thioredoxin and/or thioredoxin reductase 

25 also reduces the required pH for efficient extraction and thereby reduces calciuni or sodium 
hydroxide inputs, as well as lowering the acid input for subsequent acid precipitation, aUowing 
efficient recovery of proteins without alkali damage, and reducing water consumption and 
processing plant waste effluents (that contain substantial biological oxygen demand loads). 
Protein redox status affects innportant functional properties supplied by soy proteins, such as 

30 solubility, water absorption, viscosity, cohesion/adhesion, gelation and elasticity. Fiber removal 
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during soy protein concentrate production and soy protein isolate hydrolysis by proteases is 
enhanced by increasing thioredoxin activity as described herein. Similarly, as described for com 
above, increasing thioredoxin activity through expression of transgenic thioredoxin and/or 
thioredoxin reductase enhances the functionality of enzyme-active soy flours and the 
digestibility of the soybean meal fraction and steam flaking fraction in animal feeds. 
Modification of protein quality during seed development and during processing are both 
provided, although it is preferred that the transgenic thioredoxin and/or thioredoxin reductase 
be targeted to a cell compartment and be thermostable, as described above, to avoid significant 
adverse effects on storage protein accumulation possibly encountered as a result of thioredoxin 
activity during seed development. Alternately, the thioredoxin may be added as a processing 
enzyme, as (in contrast to com wet milling) breaking the disulphide bonds is not necessary 
until after grain integrity is destroyed (crushing and oil extraction). 

Selection Qf thioredoxin and thioredoxin r^^ u cta.<;e for h^t>.roiogo„.. ftyp r^.^cinn 
Thioredoxin, thioredoxin reductase and protein disulfide isomerase (PDI) genes are found in 
eukaiyotes including plants, eubacteria as well as archea. including hyperthermophilic 
organisms such as Methanococcus jannaschn and Archaeoglobus fulgidus. Selection of a 
particular gene depends in part on the desired application. For the methods of the present 
invention, preferred thioredoxins have the foDowing characteristics: 

1. Heat stability - Thioredoxin and related proteins from hyperthermophiles are found to have 
increased stability at high temperatures (>50 °C) and relatively low activity at ambient 
temperatures. Expression of thioredoxin and/or thioredoxin reductase from hyperthermophiles, 
for example from archea such as Methanococcus jannaschii and Archaeoglobus fulgidus or 
other hyperthemnophiles is preferred for expression during seed development, so that the 
thioredoxin activity is not markedly increased untfl the grain is steeped or processed at elevated 
temperature. Most grain processing methods involve, or are compatible with, a high 
temperature step. Thermostable thioredoxin and thioredoxin reductase are therefore preferred. 
By thermostable is meant that the enzyme is preferentiaDy active at high temperatures, e.g.. 
temperatures greater than 40°C, most preferably greater than 50°C, e.g. 45-60°C for wet" ' 
mflling, or even higher, e.g., 45-95°C. 
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2. Substrate specificity - It is also possible to reduce undesirable effects on seed development 
by selection of a thioredoxin that acts preferentiaDy on certain proteins such as the structural 
protein in the matrix and has low activity with essential metabolic enzymes. Various 
thioredoxins have been shown to differ in reactivity with enzymes that are under redox control. 
5 Thus it is possible to select a thioredoxin that will primarily act on the desired targets, 
minimizing undesirable side-effects of over expression. 

Suitable thermostable thioredoxins and thioredoxin reductases include the following: 

• amino acid sequence of thioredoxin from Methanococcus jannaschii (gill 59 1029) 

MSKVKIELFTSPMCPHCPAAKRVVEEVANEMPDAVEVEYINVME^ 
10 DVEFIGAPTKEAIiVEAIKKRL (SEQ ID NO: 1); 

• amino acid sequence of thioredoxin from Archaeoglobus fulgidus (gil2649903)(trx- 1) 
MPMVRKAAFYAIAVISGVLAAWGNALYHNFNSDLGAQAKIYFFYSDSCPHCR^ 
THNLTWCWAEMDANCSKIAQEFGIKYVPTLVIl^EEAHVFVGSDEV^ (SEQ ID 
NO:2); 

15 • amino acid sequence of thioredoxin from Archaeoglobus fulgidus (gil2649838) (trx-2) 

MVFTSKYCPYCRAFEKVVERLMGELNGTVEFEVVDVDEKREIJ^ 
GFMGFADYKTAREAILEQISAFLKPDYKN (SEQ ID NO:3); 

• amino acid sequence of thioredoxin frpm Archaeoglobus fulgidus (gil2649295) (trx-3) 
MDELELIRQKKLKEmQIOtfSGEEKARKVljDSPVia^SSN^ 

20 IAPVIEEIJ\KEYAGKVVFGKLNTDENPTIAARYGI SAIPTLIFFKKGKPVDQI1VGAMPKSEI.KR 

WVQRNL (SEQ ID NO:4); 

• amino acid sequence of thioredoxin from Archaeoglobus fulgidus {g^\264S3^9) (trx-4) 

MERI^SERFREVIQSDKLVVVDFYADWa^PCRYI SPILEKLSKEYNGEVEFYKI^^ 
EYGIAS IPTVLFFRNGKWGGFIGAMPESAVRAEIEKALGA (SEQ ID NO:5); 
25 • amino acid sequence of thioredoxin reductase (trxB) from Metlianococcus jannaschii 
(gill592167) 

MIHDTIIIGAGPGGLTAGIYAbffiGKI^ALCIEKENAGGRIAEAGIVENYPGFEEIRGYELAEKF 
KNHAEKFKLPI I YDEVIKIETKERPFKVI TKNSEYLTKTIVIATGTKPKKLGLNEDKFIGRG I S . 
YCTMCDAFFYIJSrKEVIVIGRDTPAIMSAINLKDIAKKVIVITDKSEIjKA^ 
30 EI I YNAKPLEI VGEERAEGVKI SVNGKEEI IKADGIFISLGHVPNTEFLKDSGIELDKKGFIKT 

DENCRTNIDGIYAVGDVRGGVMQVAKAVGDGCYAMANIIKYLQKL (SEQ ID NO:6); and 
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. amino acid sequence of thioredoxin reductase from Archaeoglobusfulgidus (gil2649006) 
(trxB) 

MyDVAIIGGGPAGLTAALYSARYGLKTVFFETVDPVSQLSI^IENYPGFEGSGMELLEKMKE 
QAVKAGAEWKLEKVERVEimGETFTVIAEGGEYEAKAIIVATGGKHKEAGIEGESAFIG^^^^^ 

CATCDGNFFRGKKVIVyGSGKEAIEDAIYLHDIGCEVTIVSRTPSFRAEKALVEEVEKRGIPVH 
YSTTII^IIGSGKVEKVVAYNREKKEEFEIEADGIFVAIGMRPATDWAE^^ 

KEQRTNVEGVFAAGDCCDNPLKQWTACGDGAVAAYSAYKYLTS (SEQ JD NO:7). 

The genes that encode these proteins for use in the present invention are preferably designed 
by back-translation using plant preferred codons, to enhance G-C content and remove 
detrimental sequences, as more fully described below. The activity of the proteins may be 
enhanced by DNA shuffling or other means, as described below. The invention therefore 
comprises proteins derived from these proteins, especially proteins which are substantially 
similar which retain thioredoxin or thioredoxin reductase activity. 
For engineering thioredoxin expression in seeds for activity during grain development 
promoters which direct seed-specific expression of thioredoxin and thioredoxin reductase are 
preferred, as is targeting to the storage so that the enzyme will have the desired effects on 
storage proteins, which may be desirable in some applications. In the present invention 
however, it is more generally desirable to engineer thioredoxin and/or thioredoxin reductase 
expression m seeds for accumulation and inactivity during grain development. Several 
strateg.es are employed to create seeds that express transgenic thioredoxin and/or thioredoxin 
reductase without having a significant impact on normal seed development, e.g.: 
(i) To compartmentalize active thioredoxin or thioredoxin reductase suJh that it does not 
significantly interact with the target proteins, for example by targeting to or expression in 
amyloplasts. Plastid targeting sequences are used to direct accumulation in the amyloplast 
Alternatively, the thioredoxin and/or thioredoxin reductase is targeted to an extracellular 
location in cell walls using secretion signals. Or finally, in the case of monocots. expression in 
cell types such as aleurone during seed development is used to keep die thioredoxin and/or 
thioredoxin reductase away from the storage components of the rest of die endosperm 
in) To engmeer the expression of thioredoxin and/or thioredoxin reductase from 
thermophilic organisms. Enzymes which have little or no activity at ambient ten^eratures (as 
high as 38-39°C in the field) are less likely less likely to cause problems during development. 
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Preferably, therefore, the enzymes are active primarily at high temperatures, e.g., temperatures 
greater than 40°C, most preferably 45-60''C for wet milling, or even higher, e.g., 45-95*'C. 

(iii) To place the thioredoxin and/or thioredoxin reductase under control of an inducible 
promoter, for example a chemically-inducible promoter, a wound inducible promoter, or a 

5 transactivator mediated promotor which is activated upon pollination by a plant expressing the 
transactivator, 

(iv) To utilize thioredoxin having specific requirements for a particular thioredoxin reductase, 
such that activity of the thioredoxin or thioredoxin reductase is suitably regulated via 
availability of the appropriate thioredoxin reductase or thioredoxin, respectively. For example, 

10 the thioredoxin and thioredoxin reductase are expressed in different plants, so that the active 
combination is only available in the seed upon pollination by the plant expressing the 
complimentary enzyme. Alternatively, the thioredoxin or thioredoxin reductase is sequestered 
in the cell, for example in a plastid, vacuole, or apoplast, as described above, so that it does not 
become available until the grain is processed. 

15 

Methods of grain processing 

The invention thus provides a novel method of enhancing separation of the starch from the 
protein matrix, using thioredoxin and/or thioredoxin reductase. In a first embodiment, 
thioredoxin activity is found to be useful in a variety of seed processing applications, including 

20 wet milling, dry nulling, oilseed processing, soybean processing, wheat processing and 
flour/dough quality, most especially the wet milling of grains, in particular com. 
Accordingly, the invention provides a method to improve milling efficiency or increase milling 
yield, to increase efficiency of separation of starch and protein, to enhance yields of starch and 
soluble proteins from grain, or to enhance increase protein solubility in water or other solvents, 

25 comprising steeping grain in the presence of supplemental thioredoxin and/or thioredoxin and 
separating the starch and protein components of the grain. Typically, steeping occurs before 
milling, but may occur afterwards, and there may be more than one milling or steeping step in 
the process method extraction and increase protein yield from seeds during the steep or points 
after steeping. Preferably, the supplemental thioredoxin and/or thioredoxin reductase is 

30 provided by expression of a transgene in the plant from which the grain is harvested. 

- 24 - 



BNSDOC1D: <WO ^0198509A2_L> 



wo 01/98509 

PCT/EPOl/06918 



The invention fiirther provides: the use of thioredoxin or thioredoxin reductase m a method to 
improve milling efficiency or increase milling yield of starches or proteins, for example in any 
of the methods described above, steepwater comprising an amount of thioredoxin and/or 
thioredoxin reductase effective to fecilitate separation of starch from protein in grain; grain 
which has been exposed to thioredoxin an amount effective to facilitate separation of starch 
from pibtein; and starch or protein which has been produced by the method described above 
The activity of the thioredoxin in the above method may be enhanced by supplementing the 
steepwater with thioredoxin reductase and/or NADPH. Other components nomially present in 
steepwater for wet milling may also be present, such as bacteria which produce lactic acid. 
Preferably, the steeping is carried out at a temperature of about 52°C for aperiod of 22-50 
hours, so it is desirable that the thioredoxin is stable under tiiese conditions. 
The grain may be a dicotyledonous seed, for example, an oil seed. e.g.. soybean, sunflower or 
canola, preferably soybean; or may be a monocotyledonous seed, for example a cereal seed. 
e.g., com, wheat, oats, barley, rye or rice, most preferably com. 

The Uuoredoxin may be any protein bearing thiol groups which can be reversibly oxidized to 
fom, disulfide bonds and reduced by NAPDH in the presence of a thioredoxin reductase 
Preferably the tiiioredoxin is derived from a tiiermophilic organism, as described above 
Thioredoxin and/or thioredoxin reductase for use in ti^e instant invention is suitably produced 
in an engmeered microbe, e.g. a yeast or aspergilles. or in an engineered plant capable of very 
high expression, e.g. in barley, e.g., under control of a promoter active during malting such as 
a high pi alpha-amylase promoter or other gibberellin dependent promoter. The thioredoxin (in 
excreted or extracted fom. or in combination witi. the producer organism or parts ther^f) is 
then added to the steepwater. 

As an alternative or supplement to adding the tiuoredoxin to the steepwater. the enzyme can be 
expressed directly in the seed that is to be milled. Preferably, the enzyme is expressed during 
gram maturation or during a conditioning process. 

Accordingly, in a further embodiment, the invention.provides amethod of making thioredoxin 
on an industrial scale in a transgenic organism, e.g., a plant, e.g., a cereal, such as barley or 
com, or a microorganism, e.g., a yeast or aspergillis. for example a metiiod comprising the 
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Steps of cultivating a transgenic organism having a chimeric gene which expresses thioredoxin, 
and optionally isolating or extracting the thioredoxin; 

A method of using transgenic plants that produce elevated quantities of thioredoxin during 
seed maturation or genuination such that the quality of the proteins in that seed are affected by 
5 the endogenously synthesized thioredoxin during seed development, or during the steeping 
process, thereby eliminating or reducing the need for conditioning with exogenous chemicals 
or enzymes prior to milling; 

A method of making transgenic plants that produce elevated quantities of thioredoxin during 
seed maturation or germination such that the quality of the proteins in that seed are affected by 
10 the thioredoxin during seed development or during the steeping process, thereby eliminating or 
reducing the need for conditioning with exogenous chemicals or enzymes prior to milling. 
A method for nulling grain that uses transgenic seed containing thioredoxin, that results in 
higher starch and soluble protein yields. 

The invention further comprises a transgenic organism having in its genome a chimeric 

15 expression cassette comprising a coding region encoding a thermostable or eukaryotic 
thioredoxin or thioredoxin reductase under operative control of a promoter* 
Preferably, the transgenic organism is a plant which expresses a thioredoxin and/or thioredoxin 
reductase in a form not naturally occurring in plants of that species or which expresses 
thioredoxin at higher levels than naturally occur in a plant of that species. Preferably, the 

20 thioredoxin is expressed in the seed during seed development, and is therefore preferably under 
control of a seed specific promoter. Optionafly, expression of the thioredoxin is placed under 
control of an inducible or transactivator-regulated promoter, so that expression is activated by 
chemical induction or hybridization with a transactivator when desired. The thioredoxin is 
suitably targeted to the vacuoles of the plant by fusion with a vacuole targeting sequence. 

25 In the present invention, thioredoxin coding sequences are fused to promoters active in plants 
and transformed into the nuclear genome or the plastid genome. The promoter is preferably a 
seed specific promoter such as the gamma-zein promoter. The promoter may alternatively be a 
chemically-inducible promoter such as the tobacco PR- la promoter; or may be a chemically 
induced transactivator regulated promoter wherein the transactivator is under control of a 

30 chemically induced promoter; however, in certain situations, constitutive promoters such as the 
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CaMV 35S or Gelvin promoter may be used. With a chemically inducible promoter, expression 
of the thioredoxin genes transformed into plants may be activated at an appropriate time by 
foliar application of a chemical inducer. 

Alternatively, the thioredoxin coding sequence is under control of a transactivator regulated 
promoter, and expression is achieved by crossing the plant transformed with this sequence with 
a second plant expressing the transactivator. In a preferred form of this method, the first plant 
containing the thioredoxin coding sequence is the seed parent and is male sterile, while the 
second plant expressing the transactivator is the pollinator. Expression of thioredoxin in seeds 
is achieved by interplanting the first and second plants, e.g., such that the first plant is 
pollinated by the second and thioredoxin is expressed in the seeds of the first plant by 
activation of the transactivator regulated promoter with the transactivator expressed by the 
transactivator gene from the second parent. 

The nucleic acid sequences described in this application can be incorporated into plant cells 
using conventional recombinant DNA technology. Generally, this involves inserting a coding 
sequence of the invention into an expression system to which the coding sequence is 
heterologous (i.e., not normally present) using standard cloning procedures known in the art 
The vector contains the necessary elements for the transcription and translation of the inserted 
protein-coding sequences. A large number of vector systems known in the art can be used 
such as plasmids. bacteriophage viruses and other modified viruses. Suitable vectors include, 
but are not limited to. viral vectors such as lambda vector systems XgtU, XgtlO and Charon I 
plasmid vectors such as pBI121, pBR322. pACYC]77. pACYCl84. pAR series pKK223-3 ' 
PUC8, PUC9. PUC18, PUC19, pLG339, pRK290, pKC37, pKClOl. pCDNAH; and other ' 
similar systems. The components of the expression system may also be modified to increase 
expression. For example, truncated sequences, nucleotide substitutions or other modifications 
may be employed. The expression systems described herein can be used to transfonn virtuaUy 
any crop plant ceD under suitable conditions. Transformed cells can be regenerated into whole 
plants such that the nucleotide sequence of the invention is expressed in the transgenic, plants 
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Modification of Coding Sequences and Adjacent Sequences 

The transgenic expression in plants of genes derived from microbial sources may require the 
modification of those genes to achieve and optimize their expression in plants. In particular, 
bacterial ORFs which encode separate enzymes but which are encoded by the same transcript 
5 in the native microbe are best expressed in plants on separate transcripts. To achieve this, each 
microbial ORF is isolated individually and cloned within a cassette which provides a plant 
promoter sequence at the 5' end of the ORF and a plant transcriptional terminator at the 3' end 
of the ORF. The isolated ORF sequence preferably includes the initiating ATG codon and the 
terminating STOP codon but may include additional sequence beyond the initiating ATG and 
10 the STOP codon. In addition, the ORF may be truncated, but stiU retain the required activity; 
for particularly long ORFs, tmncated versions which retain activity may be preferable for 
expression in transgenic organisms. By "plant promoter" and "plant transcriptional terminator" 
it is intended to mean promoters and transcriptional terminators which operate within plant 
cells. This includes pronK>ters and transcription terminators which may be derived from non- 
15 plant sources such as viruses (an example is the Cauliflower Mosaic Virus). 

In some cases, modification to the ORF coding sequences and adjacent sequence is not 
required. It is sufficient to isolate a fragment containing the ORF of interest and to insert it 
downstream of a plant promoter. For example, Gaffney et ah (Science 261 : 754-756 (1993)) 
have expressed the Pseudomonas nahG gene in transgenic plants under the control of the 
20 CaMV 35S promoter and the CaMV tinl terminator successfully without modification of the 
coding sequence and with x bp of the Pseudomonas gene upstream of the ATG still attached, 
and y bp downstream of the STOP codon still attached to the nahG ORF. Preferably as little 
adjacent microbial sequence should be left attached upstream of the ATG and dovmstream of 
the STOP codon. In practice, such construction may depend on the availability of restriction 
25 sites. 

In other cases, the expression of genes derived from microbial sources may provide problems 
in expression. These problems have been well characterized in the art and are particularly 
common with genes derived from certain sources such as Bacillus. These problems may apply 
to the nucleotide sequence of this invention and the modification of these genes can be 
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undertaken using techniques now well known in the art. The following problems may be 

encountered: 

1. Codon Usage. 

The preferred codon usage in plants differs from the preferred codon usage in certain 
microorganisms. Con^aiison of the usage of codons within a cloned microbial ORF to usage 
in plant genes (and in particular genes from the target plant) wiD enable an identification of the 
codons within the ORF which should preferably be changed. Typically plant evolution has 
tended towards a strong preference of the nucleotides C and G in the third base position of 
monocotyledons, whereas dicotyledons often use the nucleotides A or T at this position. By 
modifying a gene to incorporate preferred codon usage for a particular target transgenic 
species, many of the problems described below for GOAT content and illegitimate splicing will 
be overcome. 

2. GC/AT Content. 

Plant genes typically have a GC content of more than 35%. ORF sequences which are rich in A 
and T nucleotides can cause several problems in plants. Firstly, motifs of ATITA are believed 
to cause destabilization of messages and are found at the 3' end of many short-lived mRNAs. 
Secondly, the occurrence of polyadenylation signals such as AATAAA at inappropriate 
positions within the message is believed to cause premature truncation of transcription. In 
addition, monocotyledons may recognize AT-rich sequences as splice sites (see below). 

3. Sequences Adjacent to the Initiating Methionine. 

Plants differ from microorganisms in that their messages do not possess a defined ribosome 
binding site. Rather, it is believed that ribosomes attach to the 5' end of the message and scan 
for the first available ATG at which to start translation. Nevertheless, it is believed that there is 
a preference for certain nucleotides adjacent to the ATG and that expression of microbial genes 
can be enhanced by the inchision of a eukaryotic consensus translation initiator at the ATG 
aontech (1993/1994 catalog, page 210) have suggested one sequence as a consensus 
translation initiator for the expression of the E. coli uidA gene in plants. Further. Joshi (NAR 
15: 6643-6653 (1987)) has compared many plant sequences adjacent to the ATG and suggests 
another consensus sequence. In situations where difficulties are encountered in the exprellion 
of mK:robial ORFs in plants, inclusion of one of these sequences at the initiating ATG may 
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improve translation. In such cases the last three nucleotides of the consensus may not be 
appropriate for inclusion in the modified sequence due to their modification of the second AA 
residue. Preferred sequences adjacent to the initiating methionine may differ between different 
plant species. A survey of 14 maize genes located in the GenBank database provided the 
5 following results: 

Position Before the Initiating ATG in 14 Maize Genes : 
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This analysis can be done for the desired plant species into which the nucleotide sequence is 
being incorporated, and the sequence adjacent to the ATG modified to incorporate the 
preferred nucleotides. 
15 4. Removal of Illegitimate Splice Sites. 

Genes cloned from non-plant sources and not optimized for expression in plants may also 
contain motifs which may be recognized in plants as 5' or 3' splice sites, and be cleaved, thus 
generating truncated or deleted messages. These sites can be removed using the techniques 
well known in the art. 

20 Techniques for the modification of coding sequences and adjacent sequences are well known in 
the art. In cases where the initial expression of a microbial ORF is low and it is deemed 
appropriate to make alterations to the sequence as described above, then the constmction of 
synthetic genes can be accomplished according to methods weD known in the art. These are, 
for example, described in the published patent disclosures EP 0 385 962 (to Monsanto), EP 0 

25 359 472 (to Lubrizol) and WO 93/07278 (to Ciba-Geigy). In most cases it is preferable to 
assay the expression of gene constructions using transient assay protocols (which are well 
known in the art) prior to their transfer to transgenic plants. 
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Construction of Plant Expression Cassettes 

Coding sequences intended for expression in transgenic plants are first assembled in expression 
cassettes behind a suitable promoter expressible in plants. The expression cassettes may also 
comprise any further sequences required or selected for the expression of the transgene. Such 
sequences include, but are not restricted to, transcription terminators, extraneous sequences to 
enhance expression such as introns, vital sequences, and sequences intended for the targeting 
of the gene product to specific organelles and cell compartnients. These expression cassettes 
can then be easily transferred to the plant transformation vectors described below. The 
following is a description of various components of typical expression cassettes. 
1 . Promoters 

The selection of the promoter used in expression cassettes will determine the spatial and 
temporal expression pattern of the transgene in the transgenic plant. Selected promoters will 
express transgenes in specific cell types (such as leaf epidermal cells, mesophyU cells, root 
cortex ceDs) or in specific tissues or organs (roots, leaves or flowers, for exan5)le) and the 
selection wiU reflect the desired location of accumulation of the gene product. Alternatively, 
the selected promoter may drive expression of the gene under various inducing conditions. 
Promoters vary in their strength, i.e., ability to promote transcription. Depending upon the host 
ceD system utilized, any one of a number of suitable promoters can be used, including the 
gene's native promoter. The following are non-limiting examples of promoters that may be 
used in expression cassettes, 
a. Constitutive Expression, the Ubiquitin Promoter: 

Ubiquitin is a gene product known to accumulate in many cell types and its promoter has been 
cloned from several species for use in transgenic plants {e.g. sunflower - Binet etal. Plant 
Science 29: 87-94 (1991); maize - Christensen et al. Plant Molec. Biol. 12: 619-632 (1989); 
and Arabidopsis - Norris et al. Plant Mol. Biol. 21:895-906 (1993)). The maize ubiquitin 
promoter has been developed in transgenic monocot systems and its sequence and vectors 
constructed for monocot transformation are disclosed in the patent publication EP 0 342 926 
(to Lubrizol). Taylor et al. (Plant CeU Rep. 12: 491-495 (1993)) describe a vector (pAHC25) 
that comprises the maize ubiquitin promoter and first intron and its high activity in ceU 
suspensions of numerous monocotyledons when introduced via microprojectile bombardment. 
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The Arabidopsis ubiquitin promoter is ideal for use with the nucleotide sequences of the 
present invention. The ubiquitin promoter is suitable for gene expression in transgenic plants, 
. both monocotyledons and dicotyledons. Suitable vectors are derivatives of pAHC25 or any of 
the transformation vectors described in this application, modified by the introduction of the 
5 appropriate ubiquitin promoter and/or intron sequences, 
b. Constitutive Expression, the CaMV 35S Promoter: 

Construction of the plasmid pCGN1761 is described in the published patent application 

EP 0 392 225 (Example 23). pCGN1761 contains the "double" CaMV 35S promoter and the 

tml transcriptional terminator with a unique EcoRI site between the promoter and the 

10 terminator and has a pUC-type backbone. A derivative of pCGN1761 is constructed which has 
a modified polylinker which includes NotI and Xhol sites in addition to the existing EcoRI site. 
This derivative is designated pCGN1761ENX. pCGN1761ENX is useful for the cloning of 
cDNA sequences or coding sequences (including microbial ORE sequences) within its 
polylinker for the purpose of their expression under the control of the 35S promoter in 

15 transgenic plants. The entire 35S promoter-coding sequence-//n2 terminator cassette of such a 
construction can be excised by Hindlll, SphI, Sail, mdXbal sites 5* to the promoter and 
Xbal, BamHI and Bgll sites 3* to the terminator for transfer to transformation vectors such as 
those described below. Furthermore, the double 35S promoter fragment can be removed by 5' 
excision with Hindlll^ SpliI, Sail, Xbal, or Pstl, and 3' excision with any of the polylinker 

20 restriction sites {EcoRI, NotI or Xhol) for replacement with another promoter. If desired, 

modifications around the cloning sites can be made by the introduction of sequences that ntiay 
enhance translation. This is particularly useful when overexpression is desired. For example, 
pCGN1761ENX may be modified by optimization of the translational initiation site as 
described in Example 37 of U.S. Patent No. 5,639,949. 

25 c. Constitutive Expression, the Actin Promoter: 

Several isoforms of actin are known to be expressed in most cell types and consequently the 
actin promoter is a good choice for a constitutive promoter. In particular, the promoter from 
the rice ActI gene has been cloned and characterized (McElroy et al Plant CeD 2: 163-171 
(1990)). A 1.3kb fragment of the promoter was found to contain all the regulatory elements 

30 required for expression in rice protoplasts. Furthermore, numerous expression vectors based 
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on the ActI promoter have been constructed specificaUy for use in monocotyledons (McElroy 
et al Mol. Gen. Genet. 231: 150-160 (1991)). These incorporate the Acf7-intron 1 . AdhI 5' 
flanking sequence and >i JW-intron 1 (from the maize alcohol dehydrogenase gene) and 
sequence from the CaMV 35S promoter. Vectors showing highest expression were fosions of 
35S and ActI intron or the ActI 5' flanking sequence and the ActI intron. Optimization of 
sequences around the initiating ATG (of the GUS reporter gene) also enhanced expression. 
The promoter expression cassettes described by McEIroy et al. (Mol. Gen. Genet. 231: 150- 
160 (1991)) can be easily modified for gene expression and are particularly suitable for use b 
monocotyfedonous hosts. For example, promoter-containing fragments is removed from the 
McElroy constructions and used to replace the double 35S promoter in pCGN176lENX. 
which is then available for the insertion of specific gene sequences. The fusion genes thus' 
constructed can then be transferred to appropriate transformation vectors. In a separate report, 
the rice ActI promoter with its first intron has also been found to direct high expression in 
cultured barley cells (Chibbar et al. Plant Cell Rep. 12: 506-509 (1993)). 
d. Inducible Expression, the PR-1 Promoter. 

The double 35S promoter in pCGN1761ENX may be replaced with any other promoter of 
choice that win result in suitably high expression levels. By way of example, one of the 
chemicaDy regulable promoters described in U.S. Patent No. 5,614,395 may replace the 
double 35S promoter. The promoter of choice is preferably excised from its source by 
restriction enzymes, but can alternatively be PCR-amplified using primers that cany 
appropriate terminal restriction sites. Should PCR-amplification be undertaken, then the 
promoter should be re-sequenced to check for amplification errors after the cloning of the 
amplified promoter in the target vector. The chemically/pathogen regulatable tobacco PR-la 
promoter is cleaved from plasmid pCffi 1004 (for construction, see example 21 of 
EP 0 332 104) and transferred to plasmid pCGN1761ENX (Uknes et al., 1992) pCIBl004 is 
cleaved with Ncol and the resultant 3' overhang of the linearized fragment is rendered blunt by 
treatment with T4 DNA polymerase. The fragment is then cleaved with Hindlll and the 
resultant PR-la promoter-containing fragment is gel purified and cloned into pCGN1761ENX 
from which the double 35S promoter has been removed. This is done by cleavage with Xhol 
and blunting with T4 polymerase, followed by cleavage with Hu^dIII and isolation of the larger 
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vector-terminator containing fragment into which the pCIB1004 promoter fragment is cloned. 
This generates a pCGN176]ENX derivative with the PR-1 a promoter and the tml terminator 
and an intervening polylinker with unique EcoRI and NotI sites. The selected coding sequence 
can be inserted into this vector, and the frision products {Le. promoter-gene-terminator) can 

5 subsequently be transferred to any selected transformation vector, including those described 
infra. Various chemical regulators may be employed to induce expression of the selected 
coding sequence in the plants transformed according to the present invention, including the 
benzothiadiazole, isonicotinic acid, and salicylic acid compounds disclosed in U.S. Patent Nos. 
5,523,311 and 5,614,395. 

10 e. Inducible Expression, an Ethanol-Inducible Promoter: 

A promoter inducible by certain alcohols or ketones, such as ethanol, may also be used to 
confer inducible expression of a coding sequence of the present invention. Such a promoter is 
for example the alcA gene promoter from Aspergillus niduldns (Caddick et al. (1998) Nat 
Biotechnol 16:177-180). In A. nidtdans, the alcA gene encodes alcohol dehydrogenase I, the 

15 expression of which is regulated by the AlcR transcription factors in presence of the chemical 
inducer. For the purposes of the present invention, the CAT coding sequences in plasmid 
palcA:CAT comprising a^/cA gene promoter sequence fused to a minimal 358 promoter 
(Caddick et al. (1998) Nat Biotechnol 16:177-180) are replaced by a coding sequence of the 
present invention to form an expression cassette having the coding sequence under the control 

20 of the alcA gene pronK)ter. This is carried out using methods weD known in the art. 
f. Inducible Expression, a Glucocorticoid-Inducible Promoter: 

Induction of expression of a nucleic acid sequence of the present invention using systems based 
on steroid hormones is also contemplated. For example, a glucocorticoid-mediated induction 
system is used (Aoyama and Chua (1997) The Plant Journal 11: 605-612) and gene expression 

25 is induced by application of a glucocorticoid, for example a synthetic glucocorticoid, preferably 
dexamethasone, preferably at a concentration ranging from 0. ImM to ImM, more preferably 
from lOmM to lOOmM. For the purposes of the present invention, the luciferase gene 
sequences are replaced by a nucleic acid sequence of the invention to form an expression 
cassette having a nucleic acid sequence of the invention under the control of sfac copies of the 

30 GAL4 upstream activating sequences fused to the 35S minimal promoter. This is carried out 
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using methods well known in the art. The trans-acting factor conxprises the GAL4 DNA- 
bmding domain (Keegan et al. (1986) Science 231: 699-704) fosed to the transactivating 
domain of the herpes viral protein VP16 (Triezenberg et al. (1988) Genes DeveL 2: 718-729) 
fused to the hormone-binding domain of the rat glucocorticoid receptor (Picard et al (1988) 
5 Cell54: 1073-1080). The expression of the fusion protein is controlled by any promoter 

smtable for expression in plants known in the art or described here. This expression cassette is 
also comprised in the plant co„„rising a nucleic acid sequence of the invention fused to the 
6xGAL4/minimal promoter. Tl,us, tissue- or organ-specificity of the fusion protein is achieved 
leading to inducible tissue- or organ-specificity of the insecticidal toxin. 
10 g. Root Specific Expression: 

Aether pa„™ ofgc^ expression is roo, expression. A suitable root pro«o.cr is descn^ by 
<iePra„„nd(PEBS22£.: 103-.06 (1991,) and aiso in U,e published patent application 
BP 0 452 269. This promoter is transferred to a suitable vector such as pCGN1761ENX for 
the .sertion o, a selected gene and subs«,uent transfer ofthe entite pron».er-gene-ten™„a,or 
15 cassette to a transformation vector of interest, 
h. Wound-Inducible Promoters: 

Wound-inducible promoters n«y also be suitable for gene expression. Nun»„„s such 
proHK-ters have been described (e.g. Xu e, al. Plan. Molec. Biol. 22: 573-588 (1993) 
Logetnam, « al. Plan. CeB 1: 151-,58 (1989), Rohm»ier & Lehle, Plant Molec. Bioi 22' 783 
20 792(.993).Firek«<rf. Plan. Molec. Biol. 22: 129-142 (1993), Warner «ai Plant , 3. ,9, 
20U, 993)) and all are suitable for use wid, the instant invendon. Logetnann « oL ^ ^ 

upsttean, se<p,e„ces of the dicotyledonous potato ^ gene. Xn « show that a wound- 
« propter from .he dicotyledon potato (pi„2) is active in «,e n«„„co.yled„n rice 
B.tther, Rohrmeier l.hle descrih« the cloning of the maize Wp/cDNA which is wound 
«>duced and which can be used to isolate U» cognate pt^moter using standard tech„i,„es 
S.nular.R,ek«ai andWan,er«at have described a wound-i«l„ced gene from the 
monocotyledon Asparagus afflcmaUs. which is repressed at local wound and pathogen 
mvasion sites. Using clomng techniques well known to the ar, these promoters can be 

transferred to suitable vector^: fiice/i t/% fi,« « _^ . . 

oje vectors, fused to the genes pertaining to this invention, and used to 

0 express these genes at the sites of plant wounding. 
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i. Pith-Preferred Expression: 

Patent Application WO 93/07278 describes the isolation of the maize trpA gene, which is 
preferentially expressed in pith cells. The gene sequence and promoter extending up to -1726 
bp from the start of transcription are presented. Using standard molecular biological 
5 techniques, this promoter, or parts thereof, can be transferred to a vector such as pCGN1761 
where it can replace the 35S promoter and be used to drive the expression of a foreign gene in 
a pith-preferred manner. In fact, fragments containing the pith-preferred promoter or parts 
thereof can be transferred to any vector and modified for utility in transgenic plants. 

i 

Leaf-Specific Expressioti: 

10 A maize gene encoding phosphoenol carboxylase (PEPC) has been described by Hudspeth & 
Grula (Plant Molec Biol 12: 579-589 (1989)). Using standard molecular biological techniques 
the promoter for this gene can be used to drive the expression of any gene in a leaf-specific 
manner in transgenic plants, 
k. PoUen-Specific Expression: 

15 WO 93/07278 describes the isolation of the maize calcium-dependent protein kinaise (CDPK) 
gene which is expressed in pollen cells. TTie gene sequence and promoter extend up to 1400 bp 
from the start of transcription. Using standard molecular biological techniques, this promoter 
or parts thereof, can be transferred to a vector such as pCGN1761 where it can replace the 
35S promoter and be used to drive the expression of a nucleic acid sequence of the invention in 

20 a poUen-specific manner. 

2. Transcriptional Terminators 

A variety of transcriptional terminators are available for use in expression cassettes. These are 
responsible for the termination of transcription beyond the transgene and its correct 
polyadenylation. Appropriate transcriptional terminators are those that are known to function 
25 in plants and include the CaMV 35S terminator, the tml terminator, the nopaline synthase 
terminator and the pea rbcS E9 terminator. These can be used in both monocotyledons and 
dicotyledons. In addition, a gene's native transcription terminator may be used. 

3. Sequences for the Enhancement or Regulation of Expression 
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Numerous sequences have been found to enhance gene expression from within the 
transcriptional unit and these sequences can be used in conjunction with the genes of this 
invention to increase their expression in transgenic plants. 
Various intron sequences have been shown to enhance expression, particularly in 
monocotyledonous cells. For example, the introns of the maize AdM gene have been found to 
significantiy enhance the expression of the wild-type gene under its cognate promoter when 
introduced into maize cells. Intron 1 was found to be particularly effective and enhanced 
expression in fusion constructs with the chloramphenicol acetyltransferase gene (Callis et al 
Genes Develop. 1: 1 183-1200 (1987)). In the same experimental system, the intron from the' 
maize bronzel gene had a similar effect in enhancing expression. Intron sequences have been 
routinely incorporated into plant transformation vectors, typically within the non-translated 
leader. 

A number of non-translated leader sequences derived from viruses are also known to enhance 
expression, and these are particularly effective in dicotyledonous cells. Specifically leader 
sequences from Tobacco Mosaic Virus (TMV. the "W-sequence"). Maize Chlorotic Mottle 
Vmis (MCMV), and Alfalfa Mosaic Vims (AMV) have been shown to be effective in 
enhancing expression Gallie et al. Nucl. Acids Res. J5: 8693-871 1 (1987); Skuzeski et al. 
Plant Molec. BioL 15: 65-79 (1990)). 
4. Targeting of the Gene Product Within the Cell 

Various mechanisms for targeting gene products are known to exist in plants and the 
sequences controlling the functioning of these mechanisms have been characterized in some 
detail. For example, the targeting of gene products to the chloroplast is contK>lled by a signal 
sequence found at the amino terminal end of various proteins which is cleaved during 
chloroplast import to yield the mature protein {e.g. Comai et al J. Biol. Chem. 263: 15104- 
15109 (1988)). These signal sequences can be fused to heterologous gene products to effect 
the miport of heterologous products into the chloroplast (van den Broeck. et al. Nature 313- 
358-363 (1985)). DNA encoding for appropriate signal sequences can be isolated from the 5' 
end of the cDNAs encoding the RUBISCO protein, the CAB protein, the EPSP synthase 
enzyme, the GS2 protein and many other proteins which are known to be chloroplast localized 
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See also, the section entitled "Expression With Chloroplast Targeting" in Example 37 of U.S. 
Patent No. 5,639,949. 

Other gene products are localized to other organelles such as the mitochondrion and the 
peroxisome (e.g. Unger et al Plant Molec. Biol. i3: 41 1-418 (1989)). The cDNAs encoding 

5 these products can also be manipulated to effect the targeting of heterologous gene products to 
these organelles. Examples of such sequences are the nuclear-encoded ATPases and specific 
aspartate amino transferase isoforms for nnitochondria. Targeting cellular protein bodies has 
been described by Rogers etal (Proc. Natl. Acad. Sci. USA 82: 6512-6516 (1985)). 
In addition, sequences have been characterized which cause the targeting of gene products to 

10 other cell compartments. Amino terminal sequences are responsible for targeting to the ER, the 
apoplast, and extracellular secretion from aleurone cells (Koehler & Ho, Plant Cell 2: 769-783 
(1990)). Additionally, amino terminal sequences in conjunction with carboxy terminal 
sequences are responsible for vacuolar targeting of gene products (Shinshi et al Plant Molec. 
Biol. 14: 357-368 (1990)). 

15 By the fusion of the appropriate targeting sequences described above to transgene sequences 
of interest it is possible to direct the transgene product to any organelle or cell compartment. 
For chloroplast targeting, for example, the chloroplast signal sequence from the RUBISCO 
gene, the CAB gene, the EPSP synthase gene, or the GS2 gene is fused in frame to the amino 
terminal ATG of the transgene. The signal sequence selected should include the known 

20 cleavage site, and the fusion constructed should take into account any amino acids after the 
cleavage site which are required for cleavage. In some cases this requirement may be fulfilled 
by the addition of a small number of amino acids between the cleavage site and the transgene 
ATG or, alternatively, replacement of some amino acids within the transgene sequence. 
Fusions constracted for chloroplast inq>ort can be tested for efficacy of chloroplast uptake by 

25 in vitro translation of in vitro transcribed constructions foDowed by in vitro chloroplast uptake 
using techniques described by Bartlett et al In: Edelmann et al (Eds.) Methods in Chloroplast 
Molecular Biology, Elsevier pp 1081-1091 (1982) and Wasmann et al Mol. Gen. Genet. 205: 
446-453 (1986). These construction techniques are well known in the art and are equally 
applicable to mitochondria and peroxisomes. 



-38 - 



BNSDCX:iD: <WO ^01985O9A^I_> 



10 



15 



20 



WO 01/98509 

PCT/EPOl/06918 

The above-described mechanisms for cellular targeting can be utilized not only in conjunction 
with their cognate promoters, but also in conjunction with heterologous promoters so as to 
effect a specific cell-targeting goal under the transcriptional regulation of a promoter that has 
an expression pattern different to that of the promoter from which the targeting signal derives. 

Construction of Plant Transformafinn V^rtrvrc 

Numerous transformation vectors available for plant transformation are known to those of 
ordinary skill in the plant transformation arts, and the genes pertinent to this invention can be 
used in conjunction with any such vectors. The selection of vector will depend upon the 
preferred transformation technique and the target species for transformation. For certain target 
species, different antibiotic or herbicide selection markers may be preferred. Selection markers 
used routinely in transformation include the nptll gene, which confers resistance to kanamycin 
and related antibiotics (Messing & Vierra. Gene 19: 259-268 (1982); Bevan et al Nature 
3Q4: 1 84- 1 87 (1 983)). the tar gene, which confers resistance to the herbicide phosphinothricin 
(White et al.. Nucl. Acids Res IS: 1062 (1990). Spencer et al. Theor. Appl. Genet 79: 625-631 
(1990)), the hph gene, which confers resistance to the antibiotic hygromycin (Blochinger & 
Diggelmann. Mol Cell Biol 4: 2929-2931). and the dhfr gene, which confers resistance to 
methatrexate (Bourouis et al.. EMBO J. 2£7): 1099-1 104 (1983)). theEPSPS gene, which 
confers resistance to glyphosate (U.S. Patent Nos. 4.940.935 and 5.188.642), and the 
mannose-6-phosphate isomerase gene, which provides the ability to metabolize mannose (U S 
Patent Nos. 5,767,378 and 5,994,629). 



25 



1. Vectors Suitable fox Agrobacterium Transformation 

Many vectors are available for transformation u^rx^gAgrobactenum tumefaciens. These 
typicaUy cany at least one T-DNA border sequence and include vectors such as pBINl9 
(Bevan. Nucl. Acids Res. (1984)) and pXYZ. Below, the construction of two typical vectors 
suitable for Agrobacterium transformation is described, 
a. pCIB200 and pCro2001: 
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The binary vectors pcIB200 and pCIB2001 are used for the construction of recombinant 
Vectors for use v/iih Agrobacterium and are constructed in the following manner. pTJS75kan is 
created by Narl digestion of pTJS75 (Schmidhauser & Helinski, J. Bacteriol. 164 : 446-455 
(1985)) aUowing excision of the tetracycline-resistance gene, foDowed by insertion of an AccI 
5 fragment from pUC4K carrying an NPTII (Messing & Vierra, Gene 12: 259-268 (1982): 
Bevan et al., Nature 304: 184-187 (1983): McBride et al., Plant Molecular Biology 14: 266- 
276 (1990)). Xhol linkers are ligated to the EcoRV fragment of PCIB7 which contains the left 
and right T-DNA borders, a plant selectable nos/nptll chimeric gene and the pUC polylinker 
(Rothstein et al., Gene 53: 153-161 (1987)), and the J^/z^^Z-digested fragment are cloned into 
10 5a//-digested pTJS75kan to create pCIB200 (see also EP 0 332 104, example 19). pCIB200 
contains the following unique polylinker restriction sites: EcoRI, SstI, Kpnl, Bglll, Xbal, and 
SalL pCIB2001 is a derivative of pCIB200 created by the insertion into the polylinker of 
additional restriction sites. Unique restriction sites in the polylinker of pCIB2001 are EcoRI, 
Sstl KpfU, Bglll Xbal, Sail, Mlul, Bell Avrll, Apal, Hpal, and Stul. pCIB2001, in addition 
15 to containing these unique restriction sites also has plant and bacteria] kanamycin selection, left 
and right T-DNA borders for Agrofcacrenum-mediated transformation, the RK2-derived tjfA 
function for mobilization between E. coli and other hosts, and the OnTand On V functions 
also from RK2. The pC3B2001 polylinker is suitable for the cloning of plant expression 
cassettes containing their own regulatory signals. 
20 b. pCIBlO and Hygromycin Selection Derivatives thereof: 

The binary vector pCEB 10 contains a gene encoding kanamycin resistance for selection in 
plants and T-DNA right and left border sequences and incorporates sequences from the wide 
host-range plasmid pRK252 allowing it to replicate in both E. coli and Agrobacteriunu Its 
construction is described by Rothstein et al (Gene 53: 153-161 (1987)). Various derivatives of 
25 pCSBlO are constructed which incorporate the gene for hygromycin B phosphotransferase 
described by Gritz et al (Gene 25: 179-188 (1983)). These derivatives enable selection of 
transgenic plant cells on hygromycin only (pCIB743), or hygromycin and kanamycin 
(pCIB715,pCIB717). 

2. Vectors Suitable for non-Agrobacterium Transformation 
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Transformation without the use of AgroZ^ac/enttm tumefaciens circumvents the requirement 
for T-DNA sequences in the chosen transformation vector and consequently vectors lacking 
these sequences can be utilized in addition to vectors such as the ones described above which 
contain T-DNA sequences. Transformation techniques that do not rely on Agrobacterium 
include transformation via particle bombardment, protoplast uptake {e.g. PEG and 
electroporation) and microinjection. The choice of vector depends largely on the preferred 
selection for the species being transformed. Below, the construction of typical vectors suitable 
for non-Agrobacterium transformation is described, 
a. pCIB3064: 

pCffi3064 is a pUC-derived vector suitable for direct gene transfer techniques in combination 
with selection by the herbicide basta (or phosphinothricin). The plasmid paB246 comprises 
the CaMV 35S promoter in operational fusion to the E. coli GUS gene and the CaMV 35S 
transcriptional terminator and is described in the PCT published application WO 93/07278. 
The 35S promoter of this vector contains two ATG sequences 5' of the start site. These sites 
are mutated using standard PGR techniques in such a way as to remove the ATGs and generate 
the restriction sites Sspl and PvuII. The new restriction sites are 96 and 37 bp away from the 
unique SaU site and 101 and 42 bp away from the actual start site. The resultant derivative of 
pCro246 is designated paB3025. The GUS gene is then excised from pCIB3025 by digestion 
with Sail and Sad, the termini rendered blunt and religated to generate plasmid pCro3060. 
The plasmid pjrr82 is obtained from the John Innes Centre, Norwich and the a 400 bp Smal 
fragment containing the bar gene from Streptomyces viridochromogenes is excised and 
inserted into the Hpal site of pCffi3060 (Thompson et al. EMBO J ^: 2519-2523 (1987)). 
This generated pCIB3064, which comprises the bar gene under the control of the CaMV 35S 
promoter and terminator for herbicide selection, a gene for ampicillin resistance (for selection 
in E. coll) and a polylinker with the unique sites SphI, PstI, Hindlll. and BamHI. This vector is 
suitable for the cloning of plant expression cassettes containing their own regulatory signals. 

b. pSOG19 and pSOG35: 

PSOG35 is a transformation vector that utilizes the E coli gene dihydrofolate reductase (DFR) 
as a selectable marker conferring resistance to methotrexate. PGR is used to amplify the 35S 
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promoter (-800 bp), intron 6 from the maize Adhl gene (-550 bp) and 18 bp of the GUS 
untranslated leader sequence from pSOGlO. A 250-bp fragment encoding the E, coli 
dihydrofolate reductase type 11 gene is also amplified by PGR and these two PGR fragments 
are assembled with a Sacl-Pstl fragment from pB1221 (Glontech) which comprises the pUG19 

5 vector backbone and the nopaline synthase terminator. Assembly of these fragments generates 
pSOG19 which contains the 35S promoter in fusion with the intron 6 sequence, the GUS 
leader, the DHFR gene and the nopaline synthase terminator. Replacennent of the GUS leader 
in pSOG19 with the leader sequence from Maize Chlorotic Mottle Virus (MCMV) generates 
the vector pSOG35. pSOG19 and pSOG35 carry the pUC gene for ampicillin resistance and 

10 have Hindlll, SphI, PstI and EcoRI sites available for the cloning of foreign substances. 
3. Vector Suitable for Chloroplast Transformation 

For expression of a nucleotide sequence of the present invention in plant plastids, plastid 
transformation vector pPH143 (WO 97/3201 1, example 36) is used. The nucleotide sequence 
is inserted into pPH143 thereby replacing the PROTOX coding sequence. This vector is then 
15 used for plastid transformation and selection of transformants for spectinomycin resistance. 
Alternatively, the nucleotide sequence is inserted in pPH143 so that it replaces the aadH gene. 
In this case, transformants are selected for resistance to PROTOX inhibitors. 

Transformation 

20 Once a nucleic acid sequence of the invention has been cloned into an expression system, it is 
transformed into a plant cell. Methods for transformation and regeneration of plants are well 
known in the art. For example, Ti plasmid vectors have been utilized for the delivery of foreign 
DNA, as weD as direct DNA uptake, liposomes, electroporation, micro-injection, and 
microprojectiles. In addition, bacteria from the genus Agrobacterium can be utilized to 

25 transform plant cells. Below are descriptions of representative techniques for transforming both 
dicotyledonous and monocotyledonous plants, as well as a representative plastid 
transformation technique. 
1. Transformation of Dicotyledons 

Transformation techniques for dicotyledons are well known in the art and include 
30 AgrobacteriumAyz&^d techniques and techniques that do not require Agrobacterium, Non- 
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Agrobactenum techniques involve the uptake of exogenous genetic material directly by 
protoplasts or ceUs. This can be accompUshed by PEG or electroporation mediated uptake, 
particle bombardment-mediated delivery, or microinjection. Examples of these techniques are 
described by Paszkowski et al, EMBO J 3: 2717-2722 (1984), Potrykus et al, Mol. Gen. 
Genet. 199: 169-177 (1985), Reich etal. Biotechnology 4: 1001-1004 (1986), and Klein et 
al. Nature 322: 70-73 (1987). In each case the transfonned cells are regenerated to whole 
plants using standard techniques known in the art. 

Agrohactenum-v^ysAsA transformation is a preferred technique for transformation of 
dicotyledons because of its high efficiency of transformation and its broad utility with many 
different species. AgrohacteHxmi transformation typically involves the transfer of the binary 
vector carrying the foreign DNA of interest {e.g. pCIB200 or pCIB2001) to an appropriate 
Agrobacterium strain which may depend of the complement of Wr genes carried by the host 
Agrobacteriwn strain either on a co-resident Ti plasmid or chromosomally ie.g. strain CIB542 
forpCIB200andpCIB2001 (Uknes er a/. Plant Cell 5: 159-169 (1993)). ITie transfer of the 
recombinant binary vector to Agrohacterium is accomplished by a triparental mating procedure 
using E. coli carrying the recombinant binary vector, a helper E. coli strain which carries a 
plasmid such as pRK2013 and which is able to mobilize the recombinant binary vector to the 
target Agrobacterium strain. Alternatively, the recombinant binao' vector can be transferred to 
Agrobacterium by DNA transformation (Hofgen & WiUmitzer. Nucl. Acids Res. 16- 9877 
(1988)). 

Transformation of the target plant species by recombinant Agrobacterium usually involves co- 
cultivation of the Agrobacterium with explants from the plant and foDows protocols well 
known in the art. Transfonned tissue is regenerated on selectable medhim carrying the 
antibiotic or herbicide resistance marker present between the binary plasmid T-DNA borders. 
Another approach to transforming plant cells with a gene involves propelling inert or 
biologically active particles at plant tissues and cells. This technique is disclosed in U S Patent 
Nos. 4.945.050, 5,036.006. and 5,100.792 all to Sanford et al. Generally, this procedure 
involves propelling inert or biologicaUy active particles at the ceDs under conditions effective to 
penetrate the outer surface of the cell and afford incorporation within the interior ther^f. 
When inert particles are utilized, the vector can be introduced into the cell by coating the 
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particles with the vector containing the desired gene. Alternatively, the target cell can be 
surrounded by the vector so that the vector is carried into the cell by the wake of the particle. 
Biologically active particles (e.g., dried yeast cells, dried bacterium or a bacteriophage, each 
containing DNA sought to be introduced) can also be propelled into plant cell tissue. 
5 2. Transformation of Monocotyledons 

Transformation of most monocotyledon species has now also become routine. Preferred 
techniques include direct gene transfer into protoplasts using PEG or electroporation 
techniques, and particle bombardment into callus tissue. Transformations can be undertaken 
with a single DNA species or multiple DNA species (i.e. co-transformation) and both these 
10 techniques are suitable for use with this invention. Co-transformation may have the advantage 
of avoiding complete vector construction and of generating transgenic plants with unlinked loci 
for the gene of interest and the selectable marker, enabling the removal of the selectable 
marker in subsequent generations, should this be regarded desirable. However, a disadvantage 
of the use of co-transformation is the less than 100% fr^uency with which separate DNA 
15 species are integrated into the genome (Schocher et al Biotechnology 4: 1093-1096 (1986)). 
Patent Applications EP 0 292 435, EP 0 392 225, and WO 93/07278 describe techniques for 
the preparation of caDus and protoplasts from an elite inbred line of maize, transformation of 
protoplasts using PEG or electroporation, and the regeneration of maize plants from 
transformed protoplasts. Gordon-Kamm et al (Plant Cell 2: 603-618 (1990)) and Fromm et aL 
20 (Biotechnology 8: 833-839 (1990)) have published techniques for transformation of Al 88- 
derived maize line using particle bombardn:ient- Furthermore, WO 93/07278 and Koziel et ah 
(Biotechnology 11: 194-200 (1993)) describe techniques for the transformation of elite inbred 
lines of maize by particle bombardment. This technique utilizes immature maize embryos of 
1.5-2.5 mm length excised from a maize ear 14-15 days after pollination and a PDS-lOOOHe 
25 Biolistics device for bombardment. 

Transformation of rice can also be undertaken by direct gene transfer techniques utilizing 
protoplasts or particle bombardment. Protoplast-mediated transformation has been described 
for Japonica-iypcs and /nd/ca-types (Zhang al Plant Cell Rep 7: 379-384 (1988); 
Shimamoto et al Nature 338: 274-277 (1989); Datta et al. Biotechnology 736-740 (1990)). 
30 Both types are also routinely transformable using particle bombardment (Christou et al 
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Biotechnology 2: 957-962 (1991)). Furthermore, WO 93/21335 describes techniques for the 
transfonnation of rice via electroporation. 

Pa.™. Applicadon BP 0 332 581 descnbes .echmques for ,he g^neratio,. ttanrfbn»ado„ and 
regenerate,, of Pooideae prcoplas... These techniques allow the bansfonnaaon <.tDcctylis 
and wheat. Ftmhennore, wheat transformation has been described by Vasil « al 
(Biotechnology ifi: 667-674 (1992)) using panicle b„ntod„«,t into cells of type C Ion-term 
regenerable callus, and also by Vasil « oL (Biotechnology U: 1553-1558 (1993)) ami W«b 
etal (Plant Physiol m- 1077- 1084(1 993)) n^gpartcle bombardment of inmature 
embtyos and imma«re emtayo-derived callus. A p,ef«Ted technique for wheat transfonnation 
however, involves the transfonnadon of wheat by particle bombardment of immature embrvos' 
and mctades eifter a high sucrose or a high maltose step prior to gene delivety. Prior to 
bombardment, any number of embryos (0.75-1 mmin lengU.) are plated onto MS medium v»ith 
3% sucrose (Murashiga & Skoog. Physiologia PIan.an3m J5: 473-497 (1962)) and 3 mg« 2 4- 
D for n«Juction of somatic embryos, which is allowed to p^ceed in a,e dark. On the chosen 
day of bombardment, embryos are n=moved ftom the inducUon medium a«i placed onto the 
osmoucum iU. induction medium wia, s»=rose or malt^e added a. a,e desired concentration 
.yptcally .5%). ■n,e embtyos a« Howed to plasmolyze for 2-3 h and ate then bombarded ' 
Twe.^ embryos per target pb« is .ypica,. aUaough no, critical. An appropriate gene-catrying 
plasm,d (such as pCIB3(W4 or pS035) is precipi.a,ed onto micrometer size gold particles 
usmg sundard procedutes. Each plate of emhyos is shot wiu, the DuPon. BioBstics® helium 
devce using a bma pressure of -1000 psi using a standard 80 mesh screen After 
bombardment, the emho^os are placed back into the dark to recover for about 24 h (stiD on 
o^Ucum). After 24 hrs, the embryos arere„,>ved from U« osmoticum and placed back omo 
mduction medium where ti,ey stay for abou. an^nti, l«f„te regeneration. Approximately one 
month later the embryo explants with developing embryogenic calhs are transferred to 
regeneration medium (MS * 1 mg/Bter NAA. 5 mgffi.er GA), fteher comaining the 
appropriate selection agen, (10 mg/1 basta in the case of paB3064 and 2 mg. methotrexate in 
U,e case of pSOG35), After approximately one month, developed shoots a. u^sfetred to 

Urgersterifccomamersknown as "GA7s-whichcon.ain half-strength MS. 2% sucrose, and 
the same concentration of selection agent. 
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Tranformation of monocotyledons using Agrobacterium has also been described. See, WO 
94/00977 and U.S. Patent No. 5,591,616. . 
3. Transformation of Plsistids 

In another preferred einbodiment, a nucleotide sequence of the present invention is directly 
5 transformed into the plastid genome. A major advantage of plastid transformation is that 
plastids are generally capable of expressing bacterial genes without substantial modification, 
and plastids are capable of expressing multiple open reading frames under control of a single 
promoter. Plastid transformation technology is extensively described in U.S. Patent Nos. 
5,451,513, 5,545,817, and 5,545,818, in PCX application no. WO 95/16783, and in McBride et 
10 al (1994) Proc. Natl Acad. Set USA 91, 7301-7305. The basic technique for chloroplast 
transformation involves introducing regions of cloned plastid DNA flanking a selectable 
marker together with the gene of interest into a suitable target tissue, e.g., using biolistics or 
protoplast transformation (e.g., calcium chloride or PEG mediated transformation). The 1 to 
1 .5 kb flanking regions, termed targeting sequences, facilitate homologous recombination with 
15 the plastid genome and thus allow the replacement or modification of specific regions of the 
plastome. Initially, point mutations in the chloroplast 16S rRNA and rpsl2 genes conferring 
resistance to spectinomycin and/or streptomycin are utilized as selectable markers for 
transformation (Svab, Z., Hajdukiewicz, P., and Maliga, P. (1990) Proc. Natl. Acad. Sci. USA 
87, 8526-8530; Staub, J. M., and Maliga, P. (1992) Plant Cell 4, 39-45). This resulted in stable 
20 homoplasmic transformants at a frequency of approximately one per 100 bombardments of 

target leaves. The presence of cloning sites between these markers allowed creation of a plastid 
targeting vector for introduction of foreign genes (Staub, J.M., and Maliga, P. (1993) EMBO 
J[ 12, 601-606). Substantial increases in transformation frequency are obtained by replacement 
of the recessive rRNA or r-protein antibiotic resistance genes with a dominant selectable 
25 marker, the bacterial aadA gene encoding the spectinomycin-detoxifying enzyme 

aminoglycoside-3'-adenyltransferase (Svab, Z., and Maliga, P. (1993) Proc. Natl Acad, ScL 
USA 90, 913-917). Previously, this marker had been used successfully for high-frequency 
transformation of the plastid genome of the green alga Chlamydomonas reinhardtii 
(Goldschmidt-Qermont, M. (1991) Nucl Acids Res. 19: 4083-4089). Other selectable markers 
30 useful for plastid transformation are known in the art and encompassed within the scope of the 
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are 



invention. Typically, approximately 15-20 cell division cycles following transformation 
required to reach a homoplastidic state. Plastid expression, in which genes are inserted by 
homologous recombination into all of the several thousand copies of the circular plastid 
genome present in each plant cell, takes advantage of the enormous copy number advantage 
over nuclear-expressed genes to permit expression levels that can readily exceed 10% of the 
total soluble plant protein. In a preferred embodiment, a nucleotide sequence of the present 
invention is inserted into a plastid targeting vector and transformed into the plastid genome of 
a desired plant host. Plants homoplastic for plastid genomes containing a nucleotide sequence 
of the present invention are obtained, and are preferentiaDy capable of high expression of the 
nucleotide sequence. 



EXAMPLRS 

The invention wiU be fimher described by reference to the following detailed examples These 
examples are provided for purposes of illustration only, and are not intended to be limiting 
unless otherwise specified. Standard recombinant DNA and molecular cloning techniques used 
here are weD known in the art and are described by Ausubel (ed.). Current Protocols in 
Molecular Biology. John Wiley and Sons, Inc. (1994): T. Maniatis, E. F. Fritsch and J 
Sambrook, Molecular Qoning: A Laboratory Manual. Cold Spring Harbor laboratory Cold 
Spring Harbor. NY ( 1 989); and by T.J. Silhavy, M.L. Berman. and L.W. Enquist, Experiments 
with Gene Fusions, Cold Spring Harbor Laboratory. Cold Spring Harbor. NY (1984). 

EXAMPTP.I : Transfonnation of maize with heat-stable thioredoxin 

A gene expressing the heat-stable thioredoxin from Methanococcus Jannaschii, having the 
sequence shown in SEQ ID NO: 1 is prepared using maize prefeired codons as described in US 
patent 5,625,136, under control of the seed-specific gamma-zein promotor. and the expression 
cassette incorporated between the T-DNA boundried of the pGIGUP plasmid. 
Strain Asrobacterium tumefaciens I^A4404 (pAL4404. pSB 1) is used in these experiments 
PAL4404 is a disarmed helper plasmid. pSB 1 is a wide host i^ge plasmid that contains a 
region ofhomology to PGIGUP andal5.2kbWfragment from the virulence region of 
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pTiBo542 (Ishida et al., 1996; High efficiency transformation of maize (Zea mays L.) mediated. 
by Agrobacterium tumefaciens. Nature Biotechnology 14, 745-750). The introduction of the 
. plasmid pGIGUP by electroporation into LBA4404(pAL4404, pSBl) results in a cointegration 
of pGIGUP and pSB 1 . The T-DNA of this plasmid contains a mannose-6-phosphate isomerase 
5 gene driven by the ubiquitin promoter to provide the ability to metabolize mannose, as well as 
the thioredoxin gene described above. 

Agrobacterium is grown for 3 days on YP medium (5g/l yeast extract, lOg/1 peptone, 5g/l 
NaCI, 15 g/1 agar, pH 6.8) supplemented with 50 mg/I spectinomycin and 10 mg/1 tetracycline. 
Bacteria are collected with a loop and suspended in N6 liquid medium at a density ranging 
10 from 10^ to 5 lO^cells/ml. Agrobacterium cells can also be collected from an overnight culture 
in YP medium and resuspended in N6 liquid medium. 

Maize immature embryos are obtained approximately 10 to 14 days after self-pollination. The 
immature zygotic embryos are divided among different plates containing medium capable of 
inducing and supporting embryogenic callus formation at about 25 immature embryos per 
15 plate. 

The immature embryos are inoculated either on the plate or in liquid with Agrobacterium 
having a Ti plasmid comprising a selectable marker gene. The immature embryos are plated on 
callus initiation medium containing silver nitrate (10 mg/1) either prior or immediately after 
inoculation with Agrobacterium. Approximately 25 immature embryos are placed onto each 

20 plate. 16 to 72 hours after inoculation, immature embryos are transferred to callus initiation 
medium with silver nitrate and cefotaxim. Selection of transformed cells is carried out as 
follows: Mannose is used to select transformed cells in vitro. This selection can be applied as 
low as 1 g/L 2 to 20 days after inoculation and maintained for a total of 2-12 weeks. The 
embryogenic callus so obtained is regenerated in the presence or absence of mannose on 

25 standard medium of regeneration. All plants are tested by the chlorophenol red (CR) test for 
tolerance to mannose. This assay utilizes a pH sensitive indicator dye to show which cells are 
growing in the presence of mannose. Cells that grow produce a pH change in the media and 
turn the indicator Chlorophenol Red yellow from red. Plants expressing the tolerance to 
mannose are easily identified in this test. Plants positive by the CR test are assayed by PCR for 
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the presence of the mannose gene. Plants which, are positive for PGR test are analyzed by 
Southern blot. 

The regenerated plants are assayed for expression of the thioredoxin. The plants are 
developmentally normal. Com grain from progeny plants derived from the highest expressing 
event is assayed in a smaD scale wet milling process and starch extractability is measured 
compared to com of the same genotype without the thioredoxin transgene. Com expressing 
the thioredoxin gene exhibits substantially greater starch availability in the wet milling process 
than the isogenic non-transformed com 

EXAMPLE 2 : Transfonnation of maize with heat-stable thioredoxin and thioredoxin 

reductase 

Using the procedures described in Example I. maize is co-transformed with genes encoding 
both thioredoxin (SEQ ID NO: 1) and thioredoxin reductase (SEQ ID NO:6) from 
Methanococcusjannaschii. Both genes are under control of the seed specific gamma zein 
promoter. The two genes are linked and placed between the right and left borders of the 
pGIGUP plasmid to enhance the likelihood that both genes wiD be incorporated into the 
chromosome of the plant as a single insert. 

The regenerated plants are assayed for expression of the thioredoxin and thioredoxin 
reductase. The plants are developmentally normal. Com grain from progeny plants derived 
from the highest expressing event is assayed in a small scale wet milling process and starch 
extractability is measured compared to com of the same genotype without the 
thioredoxin/thioredoxin reductase transgenes. Com expressing the thioredoxin and thioredoxin 
reductase genes exhibits substantially greater starch avaflability in the wet milling process than 
the isogenic non-transformed corn 

EXA^m^: Cloning of thioredoxin genes and construction of plant transfonnation 

vectors 

Rice and wheat thioredoxin-h cDNA itrx-h) are cloned by RT-PCR using total RNA from rice 
and wheat germinated seeds, respectively. Ainplification of trx cDNA is obtained by using 
primers NMD109 -(5 '-GGA Tcc acc atg gcc gcc gag gag-3' (SEQ ID N0:8)) and 
NMD110(5'-C3AG CTC TTA GGC AGA AGC AGA TG-3 ' (SEQ ID NO:9)) for rice and 
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NMD102 ( 5 ' -GGA TCC Acc ATG GCG GCG TCG G-3 ' (SEQ ID NO: 10)) and NMD 103 
(5 '-GAG CTC TTA CTG GGC CGC GTG T-B ' (SEQ ID NO: 11)) for wheat. Insertion of 
appropriate restriction sites required for cloning the gene into plant expression vectors vizi, 
BaniHI at the 5' end and Sad at the 3* end is also achieved with this reaction. PGR products 

5 of the correct size are gel purified and cloned using the Topo PGR 2.1 cloning vector 
(Invitrogen). Colonies containing the correct insert are sequenced following restriction 
analysis. The rice rrx sequence matches that published in Genbank Accession no. U92541. The 
wheat cDNA sequence matches trx-h from T aestivium (Genbank Accession No. X69915) 
Cloning of the y zein promoter: 673 bp y zein promoter is amplified from plasmid pGZ273 

10 obtained from Dr. Brian Larkins. This sequence also exactly matches the opaque2 modifier 5' 
region (Genbank accession no. S78780) as well as (Marzabal et al. 1998. Plant J. 16:41-52). 
The y zein promoter has been shown to be endosperm specific (Torrent et al. (1997) Plant 
Mol. Biol. 34: 139-149). 

pNOV 3401 : Maize ubiquitin promoter plus intron- rice trx-h- 35S terminator in an 

15 Agrobacterium transformatioii vector with PMI selection: 

Rice trx gene is cloned into a plant expression vector containing the maize ubiquitin promoter 
plus intron and a 35S terminator. The resulting construct pNOV 3400 is digested with 
restriction endonucleases Hindlll and Kpnl to subclone into an Agrobacteriwn transformation 
binary vector cassette pNOV 21 17 to obtain pNOV 3401 . 

20 pNOV 3405: y zein promoter- rice trx-h- 35S terminator in an Agrobacterium transformation 
vector with PMI selection: 

pNOV 3406: y zein promoter- wheat trx-h" 35S terminator in an Agrobacteriwn 
transformation vector with PMI selection: 

Both rice and wheat trx-h genes are cloned into a plant expression vector containing the y zein 
25 promoter described above and a 35S terminator. The resulting constructs are digested with 
Hindlll, and Kpnl to obtain the promoter, gene, terminator units and subcloned into an 
Agrobacterium binary vector pNOV 21 17 resulting in pNOV 3405 and pNOV3406 
respectively. pNOV 21 17 is a binary vector with the gene for phosphomannose isomerase 
(PMI) driven by a maize ubiquitin promoter plus intron and a NOS terminator. 
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pNOV: 3408: y zein promoter- Tzein signal sequence- Rice r/x-Zi- yzein 3' end - 35S 
temiinator in an Agrobacterium transformation vector with PMI selection: 
To target rice thioredoxin to the endomerabrane system of the cell, signal sequences from the 
N-terminus and C-tenninus of the y- zein gene are used (Torrent et al. (1994) Planta 192: 512- 
518). Restriction sites Eco47AIII is inserted at the 5' end of the rice thioredoxin gene after the 
first ATG and restriction site Nhel is inserted at the 3' end by PGR mutagenesis using 
mutagenic primers NMD124A (5 '-GAGCTCTTAG gcgctagcag atg-3' (SEQID 
NO: 12)) and NMD125A (5 ' -ggatccacca GCGCTgccga-3 - (SEQ ID NO: 13)). A]] 
mutations are silent. The gene is cloned into a topo PCR2. 1 vector and sequenced. The tnc 
fragment is obtained by digestion with restriction enzymes Eca47III and Nhel. Four 
oligonucleotides are made to encode the yzein signal sequence and the C- terminus: NMD126 

(5 ' -GATCCACCAT GAGGGTGTTG CTCGTTGCCC TCGCTCTCCT GGCTCTCGCT 
GCGAGCGCCA CCAGC-3 ' (SEQ ID NO:14)); NMD127 (5 ' -GCTGGTGGCG CTCGCAGCGA 
GAGCCAGGAG AGCGAGGGCA ACGAGCAACA CCCTCATGGT G-3 ' (SEQ ID NO: 15)); 
NMD128(5'-CTAGCGCTCT GCAGCAGCCG ACTCCATGCC CCTACGCTGC TGCCGGCGGT 
GTCCCCCACT GAGAGCT-3 ' (SEQ ID NO: 16)); and NMD129 (5 ' -CTCAGTGGGG 
GACACCGCCG GCAGCAGCGT AGGGGCATGG AGTCGGCTGC TGCAGAGCG-3 ' (SEQID 
NO:17)). OUgo pairs NMD 126 and 127 and NMD 128 and 129 are hybridized and 
phosphorylated using T4 polynucleotide kinase foUowing standard protocols. These two 
hybridized, kinased oligo pairs are ligated in a four-way ligation reaction with Eco47III, Miel 
digested trx described above and a plant expression vector cassette containing the y zein 
promoter and 35S terminator. The resulting construct is digested with Hindin, and Kpnl to 
obtain the promoter, gene, terminator unit and subcloned into an Agrobacterium binary vector 
pNOV 21 17 containing the selectable marker phosphomannose isomerase (PMI) gene driven 
by a maize ubiquitin promoter, resulting in pNOV 3408. 

pNOV 3401. PNOV3405, pNOV3406 and pNOV3408 are transformed into Agrobacterium 
strain LBA4404 (pSBl) and used for stable maize transformation. 

Arabidopsis thioredoxin reductase is found to be active in reducing rice thioredoxin in vitro. 
Therefore a maize optimized NTR gene is constructed. 
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EXAMPT P. 4 : Construction of a maize optimized Arabidopsis N ADPH dependent 
thioredoxin reductase gene 

Thi^ Arabidopsis NADPH dependent thioredoxin reductase gene (NTR) is a 35kD 

protein. To design the synthetic gene, the deduced peptide sequence of the NTR gene 

5 (Genbank Accession # Z23109) is backtranslated using the 'TBacktranslation" program 

found in the University of Wisconsin GCG group of programs using a maize preference 

codon table (Murray et al. (1989) Nucl. Acids Res. 17: 477-498). The "maize 

optimized" sequence is further modified to insert unique sites to facilitate cloning. The 

gene is designed to be cloned in three parts. Each fragment is constructed by 

10 hybridization of 8-10 pairs of oligomers 60-75 nucleotides in length representing both 

strands of the gene. A 15 nucleotide overlap is designed between sequential oligo pairs 

for correct orientation and assembly. Oligos are synthesized by Genosys Inc., (Texas). 

Fragment 1 of the gene (corresponding to nucleotides 1-305) is constructed by 

. amplifying the.305 bp fragment by PGR using Taq polymerase and the standard 

15 conditions recommended by the supplier, an equimolar mixture of 8 oligomers as 

template and primers STRFl A (5 ' -ggATCCACCA TgAACggCCT ggAg-3' (SEQID 

' NO:18)) andSTRFlB (5'-CTCgAgAAgT CCACCTTggT CAC-3' (SEQID 

NO: 19)). The second fragment of the gene is constructed by amplifying a 346 bp 

fragment (nucleotides 299-645) by PGR using an equimolar mixture of 10 oligomers as 

20 template and primers STRF2A (5 • -CTCgAgCAAg ccgTTCAA-3 • (SEQ ID NO:20)) 

and STRF2B ( 5 • -gACgTCgATC TTCgggTTgg A-3' (SEQ ID NO:21)). The third 

fragment of the gene is constructed by amplifying a 382 bp fragment (nucleotides 639- 

1021) by PGR using an equimolar mixture of 10 oligomers as template and primers 

STR3A (5 • -CgACgTCATC TggAACTCCT-3 ' (SEQ ID NO:22)) and STR3B (5 ' - 

25 gAgCTCAgAT CTAgTCggAC TTg-3 ' (SEQ ID NO:23)). The amplified DN A for 

each fragment is cloned into a topo PCR2.1 T- vector (Invitrogen). Gene fragments 

with the correct sequence are joined using the overlapping restriction endonuclease site 

Xhol and AatIL The maize opSmze^d Arabidopsis NADPH dependent thioredoxin 

reductase coding sequence is shown as SEQ ID NO:24 and the encoded amino acid 

30 sequence is shown as SEQ ID NO:25. The complete gene is constracted and sequenced 
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and subcloned into plant expression vector cassette containing the yzein promoter and 
35S terminator. The promoter, gene, terminator unit is then subcloned into an 
Agrobacterium maize transformation vector alone and in conjunction with the rice and 
wheat thioredoxin genes. 



10 



15 



^^W^LES : Rice NADPH dependent thioredoxin reductase (NTR) gene 
The rice NADPH dependent thioredoxin reductase (NTR) coding sequence is shown as 
SEQ ID NO: 26 and the corresponding amino acid sequence is shown as SEQ ID 
NO:27. 

EXAMPLE 6 : AUgnment of Arabidopsis NTR and rice sequence described above 

Aligned Sequences 

Reference molecule: arab trPG (SEQ ID NO:25), 1-1002 (334 aa) 
Sequence 2: TRCONAA.TXT (SEQ ID NO:27), 1-310 (310 aa) 
Homology 70% 



Alignment type: Global Protein 

Parameters: Mismatch 2; Open Gap 4; Extend Gap 1 ; Conserv N 



20 



25 



30 



35 



arab trPG 
TRCCSNAA.IXT 

arab trPG 
TRCX2NAA.TXT 

arab trPG 
TRCOaJOAA.TXr 

arab trPG 
TRCXa^AA.TXT 

arab trPG 
TRCXDNAA.TXT 

arab trPG 
TRC09MAA.TXT 

arab trPG 
TRCONAA.TXT 



(1) '^^3j^~^''^''^y^^sp^ 



(145) 
(51) 



gglttttdvea&5gft)egilgveltc33cfekgserfgtti 



.g. .m.rc.a. .1. 



.S.JLS. 



(295) skpfklftdskailadavilaigavakwlsfvgsgevlgglvmrgisaca 
(101) ar. .rvas. .ttv w. t. . . .rr.h.a. . day. ..... . 

(445) vcdgaapifrnkplavigggdsaiteeanflt]Q^skvyiidrxd^^ 

* ' ^ s h....li. .nt 

(595) iinggi:alsi5>Mdvivaasswe^c3gerdvlggl3wknv^ 

* ' •••^ q.f.d.e gegggp.a.v 1. ..ki...q. 

(745) sglffai^epatkfldggveldsdgyvvO^tgtsvpgvfaagdvqd 



(247) 

(895) 
(297) 



.s.h. 

gi 



-53- 



BNSOOCID: <WO_0198509A2J_> 



wo 01/98509 




PCT/EPOl/06918 



EXAMPLE 7 : Plant transformation vectors 

PNOV4100 - PTX5'-At PPO - 35ST' and Ubq3 (At)-intron-NOS vector: 
PBH28 {Arabidopsis UbqSint - NOS) is digested with EcoRI, isolate 4756 bp band, fiU-in with 
5 Klenow, ligate to pCTK2 (PTX5* - AtPPO-35ST) digest with Hindlll- isolate 2386 bp band, 
fill-in with Klenow. pNOV 4100 contains PTX5', AtPPo,35ST*,amp,Ubq3(At) intron NOS. 
Junctions are sequenced. 

PNOV4101 - P conglycinin a* subunit promoter- soybean thioredoxin- NOS in PPO vector 
pNOV4100: 

10 Digest pNOV4100 with Hindlll and SacL Soybean p-conglycinin a' subunit promoter 

(Genbank accession # Ml 3759) is cloned by PGR using soybean leaf genomic DNA and oligos 
P9 (5'-gac taa get tac aat tat tat ate aaa atg gc-3* (SEQ ID NO:28)) and PIO (5'-gct ttt ccc aat 
acg caa tgc-3* (SEQ ID NO:29)) (Sylvain et al. (1992) Plant Mol. Biol. 19:937-949). This 
PGR product is cloned into pGR 2.1 TOPO vector and sequenced. This construct is used as 

15 template in PGR with oligos P4 (5'-gac tag cgc tga cag aaa ctg atg eta gga a-3' (SEQ ID 

NO:30)) and P9 (5'-gac taa get tac aat tat tat ate aaa atg gc-3' (SEQ ID NO:28)). Digest with 
Hindlll and Eco47IIL Soybean thioredoxin is cloned by RT-PCR using total RNA from 
soybean germinated seeds and oligos PI (5'-cgt agg ate cac cat ggc tga aga aga ggg tea ggt tgt 
c-3' (SEQ ID NO:31)) and P2 (5'-cgt aga get etc aag aag aag cag cag cag cag at-3' (SEQ ID 

20 NO:32)). This PGR product is cloned into pGR 2.1 TOPO vector and sequenced. This 

construct is used as the template in PGR with oligos P2 (5*-cgt aga get etc aag aag aag cag cag 
cag cag at-3* (SEQ ID NO:32)) and P5 (5*-gac tag cgc tga aga ggg tea ggt tgt cg-3* (SEQ ID 
NO:33)). Digest with Eco47III and Sad. 3-way ligation with the above three fragments, 
sequence is verified. 

25 PNOV4102 - P conglycinin a' subunit promoter- soybean thioredoxin-tobacco chitinase 
vacuolar signal sequence-NOS in PPO vector pNOV4100. Digest pNOV4100 with Hiiidlll 
Md SacL Soybean - (3 conglycinin a' subunit promoter is cloned by PGR using soybean leaf 
genomic DNA and oligos P9 (5*-gac taa get tac aat tat tat ate aaa atg gc-3' (SEQ ID NO:28)) 
and PIO (5'-gct ttt ccc aat acg caa tgc-3' (SEQ ID NO:29)). This PGR product is cloned into 

30 pGR 2.1 TOPO vector and sequenced. This construct is used as template in PGR with oligos 
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P4 (5'gac tag cgc tga cag aaa ctg atg eta gga a-3' (SEQ ID NO:30)) and P9 (5'-gac taa get tac 
aat tat tat ate aaa atg ge-3' (SEQ ID NO:28)). Digest with Hindlll and Eco47III. Soybean 
thioredoxin (Genbank accession # AI441505) is cloned by RT-PCR using totalRNA from 
soybean genninated seeds and oligos PI (5'-cgt agg ate cac cat ggc tga aga aga ggg tea ggt tgt 
e-3' (SEQ ID NO:31)) and P2 (5--cgt aga get etc aag aag aag cag cag cag cag at-3' (SEQ ID 
NO:32)). This PGR product is cloned into pCR 2.1 TOPO vector and sequenced. This 
construct is used as the template in PGR with oligos P2 (5'-cgt aga got etc aag aag aag cag cag 
cag cag at-3' (SEQ ID NO:32)) and P5 (5'-gac tag cge tga aga ggg tea ggt tgt cg-3' (SEQ ID 
NO:33)). Digest with Eco47III and Sad. 3-way ligation with the above three fragments, 
sequence is verified. 

PNOV4 1 03 - p conglycinin a' subunit promoter plus propeptide portion of p conglycinin - 
soybean thioredoxin- NOS in PPO vector pNOV4100. Soybean p conglycinin a' subunit 
promoter plus propeptide portion of p conglycinin is cloned by PCR using soybean leaf 
genomic DNA and oligos P9 (5'-gac taa get tac aat tat tat ate aaa atg gc-3' (SEQ ID NO:28)) 
and P12 (5'-cag tag get taa gga ggt tgc aac gag-3' (SEQ ID NO:34)). this fragment is cloned 
into pCR 2. 1 TOPO. This construct (12-4^) is digested with StuI and Sad and soybean 
thioredoxin is cloned into this vector. The restriction sites for soybean thioredoxin are modified 
by PCR. Oligos P2 {Sad) (5'-egt aga get etc aag aag aag cag cag cag cag at-3' (SEQ ID 
NO:32)) and Pi UPvuII) (5'-cag tea get gaa gag ggt cag gtt gtc-3' (SEQ ID NO:35)). This 
yields P conglycinin promoter plus propeptide + thioredoxin in pCR 2.1 TOPO caDed A-6. A-6 
and pNOV 4100 are digested with HindlH and Sad. The 1459bp fragment from A-6 is ligated 
to pNOV 4100. 

PNOV4104 - p conglycinin a' subunit promoter plus propeptide portion of P conglycinin - 
soybean thioredoxin-tobaeco chitinase vacuolar signal sequence- NOS in PPO vector 
PNOV4100. Soybean p conglycinin promoter + propeptide in pCR 2.1 TOPO (12-4-4) digest 
with StuI and Sad. PCR fragment generated using PI 1 (5'-cag tea get gaa gag ggt cag gtt gtc- 
y (SEQ ID NO:35)) and P27 (5'-cta gga get eta cat ggt gtc cac cag cag-3' (SEQ ID NO:36)), 
template BTC4 (pBluescript containing soybean thioredoxin and tobacco chitinase vacuolar 
signal sequence). Digest this fragment with PvuIJ and Sad , ligate with StuI -5ac7 fragment. 
This yields A3-10 = pCR 2.1 TOPO with P conglycinin promoter + propeptide-soybean 
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thioredoxin- tob. Chitinase vac. Signal sequence. Digest A3-10 and pNOV4100 with Hindlll 
and Sad and ligate. 

PNOV4105 -XJbq3(At)-intron-tobaGCo chitinase ER signal sequence-NOS and PTX5'-AtPP0- 
35ST'. PNOV 4105 = pNOV4100 digest with BamHI and P^tf ligate to the tobacco chitinase 

5 . ER signal sequence froni pCIB 8418 digest with BamHI and PstL This vector contains the 
Ubq3 promoter and intron with the tobacco chitinase ER signal sequence. 
PNOV4106 - Ubq3(At)-intron-tobacco chitinase ER signal sequence-soybean thioredoxin- 
tobacco chitinase vacuolar signal sequence-NOS in PPO vector pNOV4105. PNOV 4105 and 
pNov4102 are digested with Eco47III and SacI . The 387 bp band from pNOV4102 is ligated 

10 to digested pNOV4105. 

PNOV4107 - lJbq3(At)-intron-tobacco chitinase ER signal sequence-soybean thioredoxin- 
NOS in PPO vector pNOV4105. pNOV 4105 and pNOV4101 are digested with Eco47III and 
SacL The 360 bp band from pNOV4101 is ligated to digested pNOV4105. 
PNOV4108 - P conglycinin a' subunit promoter- soybean thioredoxin-tobacco chitinase 

1 5 vacuolar signal sequence-NOS in binary vector pCIB200. Digest pCIB200 with Xbal and fiDQ-in 
with Klenow. Digest pNOV4 101 with Hindlll and Kpnl, make ends blunt with T4 DNA 
polymerase and ligate the 1626 bp band to digested pCIB200. 

PNOV 4109 - p conglycinin a* subunit pronK)ter plus propeptide portion of P conglycinin — 
soybean thioredoxin- NOS in binary vector pCIB200. Digest pCIB200 with Xbal and fill-in 
20 with Klenow. Digest pNOV4103 with HindlU and Kpnl, make ends blunt with T4 DNA 
polymerase and ligate the 1748 bp band to digested pCIB200. 

PNOV4110 - p conglycinin a' subunit promoter- soybean thioredoxin- NOS Digest pCIB200 
with Xhal and fill-in with Klenow. Digest pNOV4102 with Pvull and Kpnl^ make ends blunt 
with T4 DNA polymerase and ligate the 1843 bp band to digested pCIB200. 
25 PNOy41 1 1 - p conglycinin a' subunit promoter plus propeptide portion of P conglycinin - 
soybean thioredoxin-tobacco chitinase vacuolar signal sequence- NOS in binary vector 
pCIB200. Digest pCrB200 with Xhal and fiH-in with Klenow. Digest pNOV4104 with Pvull 
and Kpnl, make ends blunt with T4 DNA polymerase and ligate the 1969 bp band to digested 
pCIB200, 
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PN0V41 12 - Soybean thioredoxin in Rcoli protein expression vector pET29a. Soybean 
thioredoxin is cloned by RT-PCR using total RNA from soybean germinated seeds and oligos 
PI (5'-cgt agg ate cac cat ggc tga aga aga ggg tea ggt tgt c-3' (SEQ ID NO:31)) and P2 (5'- 
cgt aga get etc aag aag aag cag cag cag cag at-3' .(SEQ ID NO:32)). This PGR product is 
5 digested with BainHI and Sad and cloned into pET29a digested with BamHI and Sad. The 
sequence is verified. 

PNOV4113 -Rice thioredoxin in Rcoli protein expression vector pET29a. pNOV34(K) is 
digested with BamHI and Sad, rice thioredoxin (378 bp) is cloned into pET29a which is 
digested with Baj7iHI and Sad. The sequence is verified. 
10 PNOV41 14 -Wheat thioredoxin in Kcoli protein expression vector pET29a. pNOV3406 is 
digested with BamHI and Sad, wheat thioredoxin (387 bp) is cloned into pET29a which is 
digested with BamHI and Sad. The sequence is verified. 

PNOV41 15 -Arabidopsis NADPH thioredoxin reductase in Kcoli protein expression vector 
pET29a. Arabidopsis NADPH thioredoxin reductase (Genbank accession # Z23109) is cloned 

15 by RT-PCR. Total RNA is isolated from Arabidopsis leaf with Trizol (GibcoBRL. 

Gaithersburg,MD) using the manufacturer's protocol One microgram of total RNA is used in 
the Superscript one-step RT-PCR system (GibcoBRL, Gaithersburg, MD) to generate cDNA 
and a PCR product in one step. Primers P28 (5*-gca egg ctt ggt ggt gaa tcc-3' (SEQ ID 
NO:37)) and P29 (S'-ctc att ctg gtc cat caa tgt c-3* (SEQ ID NO:38)) are used in this reaction. 

20 The manufacturer's protocol is foDowed. The resulting PCR product is diluted 1 : 10 and 1 

microliter is used in a nested PCR reaction with primers P26 (5'-gae tgt cga etc aat cae tct tae 
ctt get gag-3' (SEQ ID NO:39)) and P31 (5'-gac tgg ate caa tgg tet cga aae tea caa c-3' (SEQ 
ID NO:40)). The nested PCR product (998 bp) is gel purified, digested with BamHI and SaU 
and cloned into pET29a digested with BamHI and Sail. The sequence is verified. PNOV 4109 

25 - Digest PCIB200 with Xbal and fiD-in with Klenow. Digest pNOV4103 with Hindlll and 
Kpnl, make ends blunt with T4 DNA polymerase and ligate the 1748 bp band to digested 
PCIB200. This construct is used for transient expression analysis in soybean as well as stable 
transformation of Arabidopsis and other dicots in conjuction with thioredoxin. 
pNOV 4101, pNOV 4102. pNOV 4103, pNOV 4104, pNOV 4106, pNOV 4107 are used in 

30 transient expression experiments in soybean cotyledons. 
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Expression of thioredoxin is analyzed by western blot analysis. 
PNOV 4108, PNOV 4109, PNOV 41 10, PNOV 41 1 1 are used in stable Arabidopsis 
transformation experiments. Expression of thioredoxin is analyzed by western blot analysis. 
The effect of thioredoxin on the expression and activity of seed specific proteins is tested. 
5 PNOV 41 12, 41 13 and 41 14 are constructs containing the soybean, rice and wheat trx-h 
genes, respectively, in an Kcoli expression vector pET 29a (Novagen). These constructs are 
used to prepare thioredoxin protein for the production and purification of antibodies as well as 
standards in thioredoxin enzymatic assays. 

10 EXAMPLE 8 : Protein expression and purification 

The following constructs are used for protein expression in E.colh pNOV41 12 (soybean 
thioredoxin fai pET29a), pNOV41 13 (rice thioredoxin in pET29a), pNOV41 14 (wheat 
thioredoxin in pET29a) and pNOV41 15 {Arabidopsis thioredoxin reductase in pET29a). 
E,coli strain BL21(DE3) pLysS is transformed with each construct. A culture containing an 

15 aliquot from the glycerol stock, 50 micrograros/nd kanamycin, 34 mk:rograms/ml 

chloramphenicol in LB media is grown at 3TC until the optical density measured at 600 nm 
reaches 0.6. The cultures are stored at 4°C until the next day. These cultures are spun down 
and the cells resuspended in fresh LB. Large cultures are started using 1ml of the small culture 
per 25 ml large culture. The cells are grown in LB with 50 micrograms/ml kanamycin, 34 

20 micrograms/ml chloramphenicol at 37**C until the optical density at 600 nm reached 0.6. DPTG 
(isopropyl-P-thiogalactopyranoside) is added to a final concentration of 0.4mM to induce the 
expression of protein. The culture is grown for an additional 3 hours at 37*'C. The culture is 
spun at 3000g for 10 minutes and the cells resuspended in BugBuster (Novagen, Madison,WI) 
using an amount equal to 1/25 of the culture volume. 5 units Dnase per ml BugBuster are 

25 added. The cells are placed at -20''C over night. The cells are thawed and incubated with 
rotation at room temperature for 30 minutes. Cell debris is removed by centrifugation at 
14,000g for 20 minutes at 4°C. 

The expressed protein is a fusion protein containing the S-Tag (15 amino acids) and a thrombin 
cleavage site (6 amino acids) at the 5' end. Using the BamHI site as the 5* end cloning site of 
30 the cDNA, an additional 31 amino acids is added to the 5* end of the protein of interest. 

- 58 - 



BNSDOCID: <WO ^0198509A2_L> 



wo 01/98509 _ _ PCT/EPOl/06918 



The fosion protein is purified by affinity chromatography. Protein extract is added to S-protein 
Agarose slurry (Novagen, Madison, WI). The amount of S-protein agarose needed is 
determined for each experiment because the amount of fusion protein expressed varies. The 
yield is 0.5mg purified protein/ml of resin. The manufacturer's protocol is followed. 
To remove the S-Tag, the S-Tag Thrombin Purification kit (Novagen, Madison.WI) is used 
according to the manufacturer's protocol. 

BXAMPUE 9: Production of Antibodies 

Soybean thioredoxin antibody production: Soybean thioredoxin is purified by affinity 
chromatography using S-protein agarose and the S-Tag is removed as described above. A 
contaminating protein is present in the preparation, therefore the protein is run on a 4-20% 
Tris-glycine gel (Novex, San Diego) and the soybean thioredoxin band cut from the gel. The 
gel slice is supplied to Duncroft, Inc. (Lovettsville.VA) for antibody production in goat 
following standard operating procedure CGI "Polyclonal Antibody Production in Rabbits. 
Sheep & Goats". 

Rice Thioredoxin-Specific Antibody Purification: Rice thioredoxin is affinity purified with S- 
protein agarose (Novagen, Madison. WI) according to the manufacturer's protocol. The S-Tag 
is not removed. 

Preparation of Affi-GeMO column: Purified rice thioredoxin (1 mg) is dialyzed against 2L of 
0.1 M NaHCOa pH 8.3 for 5 hrs before coupling to Bio-rad Affi-Gel 10 gel according to the 
manufacturer's instructions. Briefly, approximately 2 ml of AfB-Gel 10 slurry is transferred to a 
glass fritted funnel that is attached to a vacuum, the solvent is removed, and the gel is washed 
twice with ice cold dH^O (at least 3 bed volumes). The moist gel cake is then transferred to the 
tube containing the dialyzed rice thioredoxin and incubated at 4»C overnight on a rotating 
wheel To ensure that all unoccupied active sites are blocked, 0.1 ml of IM ethanolamine HCL 
(pH 7.0) is added to the gel and rotated for 1 hr at 4<>C. The gel is then transferred to a 
column, washed with PBS. pre-eluted with O.IM glycine-HCl pH 2.5 (0.4 ml), and equilibrated 
in PBS. The final cohamn volume is 0,8 ml. When not in use, the column is stored at 4°C in 
PBS containing 0.2% sodium azide. 
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Purification of Rice Thioredoxin-Specific Antibodies: Soybean thioredoxin goat antiserum is 
immunoaffinity purified using an Affi-Gel 10 column of rice thioredoxin. For each run, 2 m] of 
serum is loaded by gravity onto the column. The column is washed with PBS until the A280 
was < 0.015, and then eluted with 0.4 ml O.IM glycine-HCl pH 2.5. Fractions (1 ml) are 
5 collected and neutralized with 50 |il of 0,5M Tris pH 8.5. Fractions with an A280 of 0.05 or 
greater are pooled. 

EXAMPI^ 10 : Thioredoxin Assays 

Insulin reduction assay - (Ame Holmgren, (1979) J.BioLChem. 254: 9627-9632). In this assay 
10 DTT (dithiothreitol) reduces thioredoxin. Reduced thioredoxin then reduces the disulfide 

bonds in insulin causing a white precipitate to be formed. The rate of prcipitation is recorded at 
650nm. Freshly prepared solutions of insulin (Img/ml in O.IM potassium phosphate pH 6.5), 
2mM EDTA (ethylenedianMnetetraacetic acid), and lOOmM DTT are kept on ice. The assay 
mixture is prepared in cuvettes. Each cuvette contains 750 microliters Img/ml insulin, 3.3 
15 microliters DTT, plus water to a final volume of 1ml. The blank contains no thioredoxin, the 
samples contain various amounts of thioredoxin (minimum for assay is 10 micromolar). The 
samples are prepared and incubated for a minimum of 20 minutes at room temperature before 
reading the optical density at 650 nm. 

DTNB[5,5*>dithiobis(2-nitrobenzoic acid)] assay- (Oblong et.al. (1993) Biochemistry 32: 
20 7271-7277). In this assay, thioredoxin reductase and NADPH (nicotinamide adenine 

dinucleotide phosphate) are used to reduce thioredoxin which then reduces DTNB. The change 
in optical density at 412nm is monitored over 4 minutes. Freshly prepared solutions of DTNB 
(lOOraM in DMSO dimethyl sulfoxide), NADPH (20mM in H2O) and buffer lOOmM Tris pH 
8.0, 0.1 mg/ml BSA are needed. The assay mixture is prepared in cuvettes. 10 microliters of 
25 DTNB, 10 microliters NADPH, 5 micrograms of thioredoxin, 2 micrograms of Arabidopsis or 
E.coli thioredoxin reductase and buffer to a final volume of 1 ml are added to the cuvette. As 
soon as the thioredoxin is added, mix by inversion and start measuring the change in optical 
density at 412nm immediately. This is a slow reaction. The Y-axis should be set from 0 to 
0.5A. The blank contains no thioredoxin. 

30 
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EXAMPLE 11 : Agrobacterium-me^ated transformation of maize 
Transformation plasmids and selectable marker: The genes used for transformation are cloned 
into a vector suitable for maize transformation. Vectors used contain the phosphomannose 
isomerase (PMI) gene allowing for selection of transgenic ceDs with mannose. 
Preparation of Agrobacterium tumefaciens: Agrobacterium strain LBA4404 (pSBl) containing 
the plant transformation plasmid is grown on YEPC (yeast extract (5 g/L). peptone (lOg/L), 
NaCl (5g/L), Caa2-2H20 (1.029g/l)) soKd medium with appropriate antftiotics (spectinomylin 
(100 mg/L). tetracycline (10 mg/L)) for 2 - 4 days at 28°C. Approximately 0.75 X \& 
Agrobacterium are suspended in LS modified liquid infection media supplemented with 100 
MM acetosyringone (Negrotto et al, (2000) Plant Cell Rep in press: modified with 0. IX 
phosphate). Bacteria is pre-induced in this medium for 0.5 - 2 hrs before use. Bacteria 
concentration is checked at 660nm and optical density is adjusted to approximately 0.75. 
InoculaUon: Immature embryos from A188, Hi-H or A188 X Hi-H are excised from 8 - 9 day 
old ears directly into a 1.5 ml centrifuge tube containing LS modified liquid infection media 
supplemented with 100 mM acetosyringone. Total excision time is 30 minutes. Embryos are 
vortexed for 5 seconds, allowed to setUe and infection medium is removed. Fresh infection 
medium is added. Embryos are heat shocked for 5 minutes at 45''C by placing the tube in a 
water bath. Infection media is removed and replaced with Agrobacteriimi solution. Embryos 
are vortexed for 30 seconds and allowed to settle with the bacteria for 5 minutes. The 
bacteria/embiyo solution is poured onto solidified LS modified infection media supplemented 
with 500 MM acetosyringone (Negrotto et al. ibid: modified with 0.1 X phosphate). Bacteria 
solution is carefiiUy pipetted ofi^ and cmhryos are moved to a clean section of the plate. 
Embryos, are placed scutellum side up and are co-cultured for 2 - 3 days at 22»C. 
Selection of transformed cells and regeneration of transformed plants: FoUowing co-culture, 
embryos are placed on JMS media (Suttie et al., (1991) 3- International Congress Molecule 
Biology of Plant Growth and Development Poster #905) supplemented with AgNOs and 200 
mg^ ticarcillin for callus initiation. Ticarcillin is used in afl subsequent media. After 10 days of 
culture in the dark at 28°C embryogenic callus has initiated. Callus is transferred to JMS 
medium without silver and with 10 g^ mannose, 5 g/L sucrose for selection. After 2 - 3 
weeks, surviving callus is transferred to fresh selection media. FoDowing 2 - 3 weeks. 
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surviving callus is transferred to MSAK + PO4 medium (Murashige and Skoog (1962) Physiol. 
Plant. 15:473 - 439: supplemented with 20 g/L sucrose, 5 g/L raannose, aiicimidol (0.25 
mg/L), kinetin (0.5 mg/L) and KH2PO4 (170 mg/L)) for regeneration (28°C, dark, 10 - 14 
days). Callus is transferred to fresh MSAK + PO4 media and transferred to the light (1 6hr 
5 light/8hr dark). After 1 week, regenerating shoots are transferred to MS media without 
hormones supplemented with 20 g/L sucrose and 5 g/L mannose; Rooted shoots are 
transferred to Magenta™ GA-7 boxes with 0.75 strength MS media supplemented with 10 
nd/L of Plant Preservative Mixture™ and 10 g/L sucrose for further growth. Analysis is 
performed on plants directly from GA-7 boxes or plants transfenred to soil. 

10 

EXAMPLE 12 : Soybean cotyledon transient expression system 
Sterilized seeds of the S391 1 Novartis breeding line are germinated, 5/plate, on MS sohd 
medium for 6 days under 16/8 photoperiod, 25°C. The cotyledons are explanted and sliced into 
1 -2 mm cubes. The cubes from a pair of cotyledons are arranged in a circle, 1-2 cm in 

15 diameter, in the center of a petri plate containing MS medium with 1 mg/1 BAP and 0.5 mg/L 
NAA. The tissue is bombarded with the PDS-1000 Helium gun according to the DuPont 
manual. Each plate is shot 2 times using 1550 psi rupture disks. The gold microcarriers with 
DNA are prepared according to the manual. 0.6 Mg of selected plasmid DNA are applied to 
each macrocarrier. A stainless steel screen is used to baffle the shock wave. After 

20 bombardment, the plates are returned to 16/8 photoperiod, 25**C. The first sampling is at 48 
hours. 

EXAMPLE 13 : Analysis of transgenic plants transfonned with pNOY 3401 

A. PGR: 

25 Samples are taken from transgenic plants in the GA-7 boxes. DNA is extracted as per 

manufacturer's directions using a Centra DNA extraction kit in a 96 well format- PGR is done 
using Jumpstart Redtaq Readymix (Sigma) and primers Thiorodoxubi 1603 (5*- 
GCGGTCGTTC ATTCGTTCTA-3' (SEQ ID NO:41)) and Thiorodox 2364 (5'- 
ACGTGCTTCA CGATGGTGTT-3' (SEQ ID NO:42)), at final concentration of 2.5^M each. 
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Transgenic plants identified by PGR to contain the thioredoxin gene are transfeired to the 
greenhouse. 



B. Analysis of thioredoxin RNA from transgenic plants by Northern Btot analysis: 

Total RNA is prepared from leaf and seed tissue of transgenic plants by the method described 
in Lagrimini et al. ((1987) Proc. Natl. Acad. Sci. 84: 7542-7546). Theprobe is prepared from 
the complete rice tj^-h gene . Plasmid pNOV 3401 is digested with BamHI- Sad to obtain a 
327 bp fragment which is gel purified and labelled with ^^P a-CTP using the random primer 
labelling kit from Life Technologies Inc. 

Northern blot analysis of leaf and seed RNA from representative transgenic plants 
shows expression of the thioredoxin mRNA in leaf and seed tissue. 

C. Analysis of thioredoxin proteins from transgenic plants: 

Protein extraction and western analysis of com leaf samples: A small circle, the size of the 
eppendorf lid, is punched from each leaf sample. The tissue is placed in an eppendorf tube and 
frozen m dry ice. A small pestle is used to grind the tissue in the eppendorf. 400 microliters of 
lOOmM Tris pH 8.0 is added to the ground tissue, the samples are rotated at room temperatu 
for thHty minutes, spun down and the extract saved. All samples ar^ concentrated using 
centricons with 3000 MW cutoff. 12.5 microliters of each sample is run on a 16% Tris-glycine 
(Novex, San Diego CA) mini gel with tris-glycine-SDS ( 24mM Tris 52mM glycine 1 % 
sodium dodecyl sulfate) running buffer, the protein is transferred to PVDF. The blot is blocked 
m TBS- 2% Tween (TBS -150mM NaCl SOmM Tris pH 10.2) for 15 minutes at room 
temperature, incubated with rice thioredoxin antibody (affinity purified from goat anti-soybean 
thaoredoxm) 1 microgram antibody per 1 milliliter raS-.05% tween at 4 degrees C overnight 
The blot is washed with TBS-.05% tween 3 times for 5 minutes, incubated with HRP (horse 
radish peroxidase) rabbit anti-goat IgG (50 nanograms antibody per millilter TBS-.05% tween 
for 1 hour at room ten^erature. washed with TBS-.05% tween. incubated with supersignal 
west femto chemiluminescent substrate (Pierce. Rockford, IL) for 5 minutes at room 
temperature. The blot is placed against film and exposed for 30 seconds. 



ire 
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Protein extraction and western analysis of com seed samples: One seed from each is cut in half, 
one half is frozen in dry ice and ground by mortar and pestle, 1 .5 nulliliters Tris pH 8.0 is 
added to each and incubated with rotation at room temperature for 30 minutes. The samples 
are spun and the protein extract from each is concentrated using centricons with a 10,000 MW 

5 cut offi 12.5 microliters of each sample is run on a 16% tris-glycine gel (same conditions as 
leaf samples). The gel is transferred to nitrocellulose, blocked with TBS-2% tween for 15 
minutes, incubated with rice thioredoxin antibody (1 nmcrogram antibody per 1 milliliter TBS- 
.05% tween) for 1.5 hours at room temperature, washed and incubated with HRP rabbit anti- 
goat IgG as described above. The blot is incubated with supersignal west pico 

10 chemiluminescent substrate (Pierce, Rockford, IL) for 5 minutes at room temperature. The blot 
is placed against film and exposed for 5 minutes. Western blot analysis shows expression of the 
rice thioredoxin protein in leaf and seed tissue. The Western blot analysis also shows that the 
rice thioredoxm expressed in transgenic plants has the expected size when compared to a 
control rice thioredoxin loaded on the same gel. 

15 

RXAMPT.R 14 : En^matic activity of recombinant thioredoxin and thioredoxin 
reductase expressed in E. coli 

Recombinant soybean thioredoxin expressed in £. coli is purified by affinity chromatography 

with S-protein agarose and the S-Tag removed by thrombin cleavage. This protein is tested in 

20 the insulin reduction assay as described. 4, 20, 40 and 80 micrograms of afiinity purified 

thioredoxin (one contaminating protein present) is tested. After 31 minutes, a change in optical 

density at 650nm is measured. 

Thioredoxin (|Xg) OD 650nm (after 3 1 n[)inutes) Rate of Precipitation (AAeso/min) 
25 0 .0000 .000000 

1 .0010 .000032 

2 .0077 .00025 

3 .0084 .00027 

4 .0117 .00038 

30 
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The following recombinant proteins are tested in the NADPH thioredoxin reductase DTNB 
assay: soybean thioredoxin with S-tag, soybean thioredoxin prep without S-tag, rice 
thioredoxin with S-tag, wheat thioredoxin with S-tag and Arabidopsis thioredoxin reductase 
without S-tag. Kcoli thioredoxin reductase (T-7915, Signia. St. Louis. MO) is also used in the 
assays. The change in optical density at 412nm is monitored over 4 min. 



10 



15 



20 



Thioredoxin 
0^1] 

30^1 (=12ng) soybean 
30^1 («12ng) soybean 
30^1 (= 6M.g) soybean 
30|ll (=15Hg) rice 
30^a (=15^g) rice 
30jil («0.6^g) wheat 
30|il (=0.6^g) wheat 
30nl (=1.2^ig) wheat 



Thioredoxi n Reductay e AA^ 

1 .5pJ (=0.5^ig) Arabidopsis 0.06 

2.3\lg Kcoli 0.333 

1 .5fJj («0.5|Lig) Arabidopsis 0.42 

2.3\lgE.coli 0.20 

1 .5fil («0.5|xg) Arabidopsis 0.66 

2.3flg Kcoli 0.30 

2.3^lg E.coli 0.30 

1 .5^1 (==0.5ng) Arabidopsis 0.08 

1 .5\l\ («0.5ilg) Arabidopsis 0.08 



Arabidopsis thioredoxin reductase and K coli thioredoxin reductase can reduce soybean 
thioredoxin with and without the S-tag. Arabidopsis and E. coli thioredoxin reductases 
also reduce rice thioredoxin. Wheat thioredoxin can be reduced by Kcoli thioredoxin 
reductase but not by Arabidopsis thioredoxin reductase. 



can 
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What is claimed is : 

1 . A method for separating the starch and protein components of grain in a milling process, 
comprising: 

(a) steeping the grain at an elevated temperature in the presence of supplemental 
5 thioredoxin reductase; and 

(b) separating the starch and protein components of the grain, wherein the thioredo?cin 
reductase is a eukaryotic thioredoxin reductase. 

2. The method of claim 1 , wherein the grain comprises grain from a transgenic plant that 
10 expresses thioredoxin reductase. 

3. The method of claim 2, wherein the plant is corn. 

4. A transgenic plant comprising heterologous DNA coding for eucaryotic thioredoxin 
15 reductase stably integrated into its nuclear or plastid genome. 

5. The plant of claim 4 wherein the plant is com or soybean. 

6. The plant of claim 4 wherein the thioredoxin reductase comprises SEQ ID NO:25 or SEQ 
20 IDNO:27. 

7. A chimeric expression cassette comprising a coding region for a eukaryotic thioredoxin 
reductase operatively linked to promoter and terminator sequences that function in a plant. 

25 8. The chimeric expression cassette of claim 7 wherein the thioredoxin reductase comprises 
SEQ ID NO:25 or SEQ ID NO:27. 

9. A method of producing grain that expresses increased levels of eukaryotic thioredoxin 
reductase comprising transforming plants with an expression cassette of claim 7. 

30 
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10. A method of producing grain that expresses increased levels of eukaryotic thioredoxin 
reductase conq)rising transforming plants with an expression cassette of claim 8. 

1 1. A method of producing grain that expresses increased levels of thioredoxin reductase, 
comprising: pollinating a first plant comprising a heterologous expression cassette 
comprising a transactivator-mediated promoter operatively linked to a DNA sequence 
coding for a eukaryotic thioredoxin reductase with pollen from a second plant comprising 
a heterologous expression cassette comprising a promoter operatively linked to a DNA 
sequence coding for a transactivator capable of regulating said transactivator-mediated 
promoter; and recovering grain from the plant thus pollinated. 



12. An isolated nucleic acid molecule comprising SEQ ID NO:24 or SEQ ID NO; 



26. 



1 3. A chimeric gene comprising a promoter active in plants operatively linked to the nucleic 
acid molecule of claim 12. 



14. A recombinant vector comprising the chimeric gene of claim 13. 

15. A transgenic host cell comprising the chimeric gene of claim 13. 

16. A transgenic host cell according to claim 15. which is a transgenic plant cell. 

17. A transgenic plant comprising the transgenic plant cell of claim 16. 

18. The transgenic plant of claim 17, which is com or soybean. 

19. Seed from a transgenic plant according to claim 17 comprising the chimeric gene of claim 
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<210> 1 
<211> 85 
<212> PRT 

<213> Methanococcus jannascihii 
<400> 1 

Met Ser Lys Val Lys lie Glu Leu Ehe Thr Ser Pro Met Cys Pro His 
1 5 10 15 

cys Pro Ala Ala Lys Arg Val Val Glu Glu Val Ala Asn Glu Met Pro 
20 25 30 

Asp Ala Val Glu Val Glu Tyr He Asn Val Met Glu Asn Pro Gin Lys 
35 40 45 

Ala Met Glu Tyr Gly He Met Ala Val Pro Ihr He Val He Asn Gly 
50 55 60 

Asp Val Glu Hie He Gly Ala Pro Hir Lys Glu Ala Leu Val Glu Ala 
65 70 75 80 

He Lys Lys Arg Leu 

85 



<210> 2 
<211> 119 

<212> PRT 

<213> Archaeogldbus fulgidus 
<400> 2 

Met Pro Met Val Arg Lys Ala Ala Phe Tyr Ala He Ala Val He Ser 
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10 



15 



Gly val Leu Ala Ala Val Val Gly Asn Ala Leu Tyr His Asn Phe Asn 
20 25 30 

Ser Asp l^u Gly Ala Gin Ala Lys He Tyr Hie Phe Tyr Ser Asp Ser 
35 40 , 45 

Cys Pro His Cys Arg Glu Val Lys Pro Tyr Val Glu Glu Phe Ala Lys 

55 60 

Ihr His Asn Leu Ihr Trp Cys Asn Val Ala Glu Met Asp Ala Asn Cys 

'^^ 75 80 

Ser Lys He Ala Gin Glu Phe Gly lie Lys Tyr Val Pro Thr Leu Val 
85 90 95 

He Met Asp Glu Glu Ala His Val Phe Val Gly Ser Asp Glu Val Arg 
100 105 

Thr Ala He Glu Gly Met Lys 
115 



<210> 3 
<211> 93 
<212> PRT 

<213> Arc±iaeogldbus fulgidus 
<400> 3 

Met val Phe Thr ser Lys Tyr cys Pro Tyr cys Arg Ala Phe Glu Lys 

5 10 15 ^ 

val val Glu Arg Leu Met Gly Glu I|u Asn Gly Thr Val Glu Phe Glu 



Val Val Asp val Asp Glu Lys Arg Glu Leu Ala Glu Lys Tyr Glu 
•^^ 40 45 



Val 



Ifet Pro Thr Leu val Leu Ala Asp Gly Asp Glu Val Leu Gly 

55 60 

Gly Phe Met Gly Phe Ala Asp Tyr Lys Thr Ala Arg Glu Ala He Leu 

^° 75 80 

Glu Gin He Ser Ala Phe Leu Lys Pro Asp Tyr Lys Asn 
85 90 

<210> 4 
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<211> 134 
<212> ERT 

<213> Ardiaeogldbus fulgidus 
<400> 4 

Met Asp Glu Leu Glu Leu lie Arg Gin Lys Lys Leu Lys Glu Dfet Ifet 
1 5 10 15 

Gin Lys Met Ser Gly Glu Glu Lys Ala Arg Lys Val Leu Asp Ser Pro 
20 25 30. 

Val Lys Leu Asn Ser Ser Asn Phe Asp Glu Hhr Leu Lys Asn Asn Glu 
35 40 45 

Asn Val Val Val Asp Phe Trp Ala Glu Trp O/s Met Pro Cys Lys Met 
50 .55 60 

lie Ala Pro Val He Glu Glu Leu Ala Lys Glu Tyr Ala Gly Lys Val 
65 70 75 80 

Val Phe Gly Lys Leu Asn Ihr Asp Glu Asn Pro Thr He Ala Ala Arg 

85 90 95 

Tyr Gly He Ser Ala He Pro ihr Leu He Phe Phe Lys Lys Gly Lys 
100 105 110 

Pro Val Asp Gin Leu Val Gly Ala Met Pro Lys Ser Glu Leu Lys Arg 
115 120 125 

Trp Val Gin Arg Asn Leu 

130 . , 



<210> 5 
<211> 105 
<212> PRT 

<213> Archaeogloibus fulgidus 
<400> 5 

Met Glu Arg Leu Asn Ser Glu Arg Hie Arg Glu Val He Gin Ser Asp 
1 5 10 15 

Lys Leu Val Val Val Asp Phe Tyr Ala Asp Trp Cys Met Pro Cys Arg 
20 25 30 

Tyr He Ser Pro He. Leu Glu Lys Leu Ser Lyis Glu Tyr . Asn Gly Glu 
.35 40 45 

Val Glu Phe Tyr Lys Leu Asn Val Asp Glu Asn Gin Asp Val Ala Phe 
50 55 60 
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Glu Tyr Gly He Ala Ser He Pro Thr Val Leu Phe Phe Arg Asn Gly 
^ 75 80 

Lys Val Val Gly Gly Phe He Gly Ala mt Pro Glu Ser Ala Val Arg 
85 90 95 

Ala Glu He Glu Lys Ala Leu Gly Ala 
100 105 



<210> 6 
<211> 301 
<212> PRT 

<213> tfethanococcus jannaschii 
<400> 6 

He His Asp Thr He He He Gly Ala Gly Pro Gly Gly Leu Thr 
5 10 15 

Ala Gly He lyr Ala Met Arg Gly Lys Leu Asn Ala Leu Cys He Glu 
2° 25 30 

Lys Glu Asn Ala Gly Gly Arg He Ala Glu Ala Gly He Val Glu Asn 

40 45 

Tyr Jto Gly Phe Glu Glu He Arg Gly Tyr Glu Leu Ala Glu Lys Phe 

55 60 

Asn His Ala Glu Lys Phe Lys Leu Pro He He Tyr Asp Glu Val 

75 80 
He Lys He Glu Ihr Lys Glu Arg Pro Phe Lys Val He Ht^ Lys Asn 



85 90 



95 



ser Glu Tyr Leu Ihr Lys Thr He Val He Ala Thr Gly ^ Lys Pro 

Lys Lys Leu Gly Leu Asn Glu Asp Lys Phe He Gly Arg Gly He Ser 
^ 120 125 

Tyr Thr Met Cys Asp Ala Phe Phe Tyr Leu Asn Lys Glu Val He 

135 

val He Gly Arg Asp Thr Pro Ala He Met Ser Ala He Asn Leu Lys 



155 



160 



Asp He Ala Lys Lys Val He Val He Ihr Asp Lys Ser Glu Leu Lys 
165 170 175 
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Ala Ala Glu Ser lie Met Leu Asp Lys Leu Lys Glu Ala Asn Asn Val 
180 185 190 

Glu lie lie Tyr Asn Ala Lys Pro Leu Glu lie Val Gly Glu Glu Arg 
195 200 205 

Ala Glu Gly Val Lys lie Ser Val Asn Gly Lys Glu Glu lie lie Lys 
210 215 220 

Ala Asp Gly lie Phe lie Ser Leu Gly His Val Pro Asn Hir Glu Phe 
225 230 235 240 

Leu Lys Asp Ser Gly He Glu Leu Asp Lys Lys Gly Phe He Lys Ihr 
245 250 255 

Asp Glu Asn Cys Arcf Thr Asn He Asp. Gly He Tyr Ala VcQ Gly Asp 
260 265 270 

Val Arg Gly Gly Val Met Gin Val Ala Lys Ala Val Gly Asp Gly Cys 
275 280 285 

Val Ala Met Ala Asn He He Lys Tyr Leu Gin Lys Leu 
290 295 300 



<210> 7 
<211> 300 
<212> PIRT 

<213> Archaeoglcjibus fulgidus 
<400> 7 

Met Tyr Asp Val Ala He He Gly Gly Gly Pro Ala Gly Leu Thr Ala 
15 10 15 

Ala Leu Tyr Ser Ala Arg Tyr Gly Leu Lys Thr Val Phe Phe Glu Thr 
20 25 . 30 

Val Asp Pro Val Ser Gin Leu Ser Leu Ala Ala Lys He Glu Asn Tyr 
35 40 45 

Pro Gly Phe Glu Gly Ser Gly Met Glu Leu Leu Glu Lys Met Lys Glu 
50 .55 60 

Gin Ala Val Lys Ala Gly Ala Glu Trp Lys Leu Glu Lys Val Glu Arg 
65 70 75 80 

Val Glu Arg Asn Gly Glu Thr. Phe Thr Val He Ala Glu Gly. Gly Glu 



95 



Tyr Glu Ala Lys 



Ala He He Val Ala Thr Gly Gly Lys His Lys Glu 
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100 105 
Ala Gly lie Glu Gly Glu Ser Ala Phe He Gly Arg Gly Val Ser lyr 



120 



125 



Cys Ma Ihr cys ASP Gly Asn Phe Phe Arg Gly Lys lys Val He Val 

135 

^ Gly ser Gly Ly. ^ Tyr Leu as Asp 

150 

lie Gly cys Glu Val ^ He Val Ser Arg Ihr Pro Ser Phe Arg Ala 

1'70 175 

Glu Lys Ala Iju Val Glu Glu Val Glu Lys Arg Gly He Pro Val His 

185 190 

^ ser Thr He ^ ^y, n ^ ^ 

200 205 

val val lyr «sn srs, olu Lys Lys Gl. Glu Phe Glu He Glu Ala 

220 

^ Gly lie fte Sla ile Gly Arg f^o Ala ar Val Val 



235 



240 



Ma Glu Gly Val Glu Arg Asp mt Gly Tyr He Lys Val Asp 

250 255 

Glu Gl. ^ ar A« val Glu ^y Val Phe Ala Ala Gly Asp Cys 

265 270 

*sp Asn Pro Lys Gin val val ar Ala CVS Gly ASP Gly Ala 

^BO 285 

Val Ala Ala Tyr Ser Ala ^ Lys Tyr Leu Ihr Ser 



<210> 8 
<211> 24 

<2i2> rm 

<213> Artificial Sequence 

.<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide (primer NMD109) 

<400> 8 

ggatccacca t^ccgccga ggag 
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<210> 9 
<211> 23 

<2i2> mk 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide (primer NMDllO) 

<400> 9 

gagctcttag gcagaagcag atg 



<210> 10 
<211> 22 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide (primer NMD 102) 

<400> 10 

ggatccacca tggcggcgtc gg 22 



<210> 11 
<211> 22 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide (primer 1SIMD103) 

<400> 11 

gagctcttac tgggccgcgt gt .22 



<210> 12 
<211> 23 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide (primer NMD124A) 



-7- 
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<400> 12 

gagctcttag gcgctagcag atg 

<210> 13 
<211> 20 

<2i2> rm 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence- 
oligonucleotide (primer NMD125A) 

<400> 13 

ggatccacca gcgctgccga 

<210> 14 
<211> 65 
<212> rm 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Secjuence- 
oligonucleotide (primer NMD126) 

<400> 14 

gatccaccat gagggtgttg ctcgttgccc tcactctcct r^rrr^i-^^ ^ 

ccagc t^cgctctcct ggctctcgct gcgagcgcca 60 

65 

<210> 15 

<211> 61 
<212> UfJA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide (primer NMD 127) 

<400> 15 

gctggtggcg ctcgcagoga gagccaggag agcgagggca acgagcaaca ccctcatggt 60 

61 

<210> 16 
<211> 67 

<2i2> rm. 

<213> Artificial Sequence 



23 
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<220> 

<223> Description of Artificial Sequence: 
oligonucleotide (primer MMD 128) 

<400> 16 

ctagcgctct gcagcagccg actccatgcc cctacgctgc tgccggcggt gtcccccact 60 
gagagct 67 

<210> 17 
<211> 59 

<2i2> rm 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: 
oligonucleoticae (primer NMD129) 

<400> 17 

ctcagtgggg gacaccgccg gcagcagcgt aggggcatgg agtcggctgc tgcagagcg 59 



<210> 18 
<211> 24 

<212> rm 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide (primer STE^IA) 

<400> 18 

ggatccacca tgaacggcct ggag 24 



<210> 19 
<211> 23 
<212> mh 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide (primer STE^IB) 

<400> 19 

ctcgagaagt ccaccttggt cac 23 



<210> 20 
<211> .18 



9- 
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.<2i2> rm 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide (primer STE^F2A) 

<400> 20 

ctcgagcaag ccgttcaa 

<210> 21 
<211> 21 
<212> nsiA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide (primer STERF2B) 

<400> 21 

gacgtcgatc ttcgggttgg a 

<210> 22 
<211> 20 
<212> mh 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide (primer STK3A) 

<400> 22 

cgacgtcatc tggaactcct 

<210> 23 
<211> 23 

<212> rm 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide (priit^ STEQB) 

<400> 23 

gagctcagat ctagtcggac ttg 




PCT/EPOl/06918 



18 



21 



20 



- 10- 



wo 01/98509 




PCT/EPOl/06918 



<210> 24 
<211> 1021 

<2i2> rm 

<213> Arabidopsis thaliana 
<400> 24 

ggatccacca tgaacggcct ggagactcac aacacccgcc tctgcatcgt tggctccggc 60 
ccggctgccc acaccgccgc catctacgcc gcccgcgccg agctgaagcc gctcctcttc 120 
gagggctgga tggccaacga catcgccccg ggcggccagc tcaccaccac caccgacgtg 180 
gagaacttcc ccggcttccc ggagggcatc ctcggcgtgg agctgaccga caagttccgc 240 
aagcagagcg agcgcttcgg caccaccatc ttcaccgaga cogtgaccaa ggtggacttc 300 
tcgagcaagc cgttcaagct cttcaccgac tccaaggcca tcctcgccga cgccgtgatc 360 
ctcgccatcg gcgccgtggc caagtggctc tccttcgtgg gctccggcga ggtgctcggc 420 
ggcctctgga accgcggcat ctccgcctgc gctgtgtgcg acggcgccgc cccgatcttc 480 
cgcaacaagc cgctcgctgt gatcggtggc ggagacagcg cgatggagga ggccaacttc 540 
ctcaccaagt acggctccaa ggtgtacatc atcgaccgcc gcgacgcctt ccgcgcctcc 600 
aagatcatgc agcagcgcgc cctctccaac ccgaagatcg acgtcatctg gaactcctcc 660 
gtggtggagg cctacggcga cggcgagcgc gacgtgctcg gcggcctcaa ggtgaagaac 720 
gtggtgaccg gcgacgtgtc cgacctcaag gtgtccggcc tcttcttcgc catcggccac 780 
gagccggcca ccaagttcct cgacggcggc gtggagctgg actccgacgg ctacgtggtg 840 
accaagccgg gcaccaccca gacctccgtg cctggcgtgt tcgccgccgg cgacgtgcag 900 
gacaagaagt accgccaggc catcaccgcc gccggcaccg gctgcatggc cgccctcgac 960 
gccgagcact acctccagga gatcggctcc cagcagggca agtccgacta gatctgagct 1020 
c . 1021 



<210> 25 
<211> 333 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 25 

Met Asn Gly Leu Glu T!hr His Asn TSir Arg Leu Cys lie Val Gly Ser 
1 5 10 15 

Gly -Pro Ala Ala His Thr Ala Ala lie Tyr Ala Ala Arg Ala Glu Leu 
20 25 30 

Lys I>ro Leu Leu Phe Glu Gly Trp Met Ala Asn Asp He Ala Pro Gly 
35 40 45 

Gly Gin Leu Thr Thr Hir Thr Asp Val Glu Asn Phe Pro Gly Phe Pro 
50 55 60 

Glu Gly He Leu Gly Val Glu Leu Ihr Asp Lys Phe Arg Lys Gin Ser 
65 70 75 80 

Glu Arg Phe Gly Thr Thr He Phe Ihr Glu Thr Val Thr Lys Val Asp 

85 90 95 
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Phe Ser Ser Lys Pro Phe Lys Leu Phe Thr Asp Ser Lys Ala lie Leu 

105 no 

Ala Asp Ala Val He Leu Ala lie Gly Ala Val Ala Lys Trp Leu Ser 
115 120 125 

Phe Val Gly Ser Gly Glu Val Gly Gly Leu Trp Asn Arg Gly He 
130 135 

Ser Ala Cys Ala Val Cys Asp Gly Ala Ala Pro He Phe Arg Asn Lys 

150 155 

Pro Leu Ala Val He Gly Gly Gly Asp Ser Ala Met Glu Glu Ala Asn 
165 170 175 

Phe Leu Ihr Lys Tyr Gly Ser Lys Val Tyr He He Asp Arg Arg Asp 
180 185 190 

Ala Phe Arg Ala Ser Lys He Met Gin Gin Arg Ala Leu Ser Asn Pro 
1^5 200 205 

Lys He Asp Val He Ttp Asn Ser Ser Val Val Glu Ala Tyr Gly Asp 
-^J-U 215 220 

Gly Glu Arg Asp Val Leu Gly Gly Leu Lys Val Lys Asn Val Val Thr 

230 235 240 

Gly Asp Val Ser Asp Leu Lys Val Ser Gly Leu Hie Phe Ala He Gly 
245 250 255 

His Glu Pro Ala Thr Lys Phe Leu Asp Gly Gly Val Glu Leu Asp Ser 
260 265 270 

Asp Gly Tyr val Val Thr Lys Rro Gly Thr Thr Gin Thr Ser Val Pro 
275 280 285 

Asp Val Gin Asp Lys Lys Tyr Arg Gin Ala 
290 295 300 

He Thr Ala Ala Gly Thr Gly Cys Mfet Ala Ala Leu Asp Ala Glu His 

310 315 

Tyr Leu Gin Glu He Gly Ser Gin Gin Gly Lys Ser Asp 
325 330 



<210> 26 
<211> 1560 

<2i2> rm 

<213> Qryza sativa 
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<400> 26 

aytcagatat gttatccaga ttctaaatgt gctatagggg wtaaatgtgt gttcatatgg 60 
gagatatatc agtttcagtt tttttggaag gtgtttatag gagttraggcg cgttttaaar 120 
3<:t.gtggt:atg catcgtgttg tgarttgttk gtgtgttycy ttaaaaaaaa awttgccatt 180 
tgtcaattat tgtggaattt ctgcaacttg ttgtccamaag kaaaaggaaa atagtttcgg 240 
tcaacaactc aacatccatc tgggggtatg accgaccgag cgcggtggcc gttgattggc 300 
tcgtcgcctc ctcccttctc ggtctgacgg tctgaccacrt gccgggtagg aagcgtaatt 360 
ttgaggagag actccgaccc gcgccgccgc cgccgcagcc aagccatgga gggatccgcc 420 
ggggcgccgc tccgcacgcg cctgtgcatc atcgggagcg ggccgtcggc gcacacggcg 480 
gcgatctacg ccgcccgcgc ggagctcaag cccgtgctct tcgagggctg gctcgccaac 540 
gacatcgcgg cggggggcca gctcaccacc accaccgacg tcgagaactt cccggggttc 600 
cccgagggga tcctcggcgg cgagctcatg gatcggtgcc gcgcccagtc cctccggttc 660 
ggcaccagca tcatctccga gaccgtcacc gcggtcgact tctccgcccg ccccttccgc 720 
gtcgcctccg actccaccac cgtgctcgcc gacgccgtcg tcgtcgccac cggcgccgtc 780 
gcccggcgac tccacttcgc cggctccgac gcctactgga accgcggcat ctcagcctgc 840 
gccgtctgcg acggggccgc cccaatcttc aggaacaaac ccatcgccgt catcggcggc 900 
ggcgactccg ccatggagga gtccaacttc ctcaccaagt acggctccca tgtgtacatc 960 
atccaccgcc gcaacacctt ccgcgcctcc aagatcatgc aggccagggc gttgtcaaac 1020 
cccaagatcc aggttttctg ggactctgag gtcgtcgagg cctacggcgg cgagggtgga 1080 
ggtccattgg ctggtgtcaa ggtgaagaac ttggttactg ggaagatctc cgaccttcag 1140 
gtgtccggtc tcttcttcgc catcggacat gaaccggcga cgaagtttct cggcgggcag 1200 
cttgagctcg atgctgatgg gtatgtggcc accaagccag gctccacgca caccagtgtg 1260 
aagggggtct ttgctgctgg ggatgtgcag gacaagaagt atcgccaggc tattactgcc 1320 
gctggatcag gtttgtgaat tgatgatttt tcaggttacc tgtgattaat ttttttctgc 1380 
actttcttag agatcagtcg cttcatgggt tgctatttgc tagtgcgaat tgcaatagaa 1440 
attgttcagg gcttgagtat gtagtgagcg aatgatgatg gtcaaaatta gaaccttttt 1500 
aagctatcat agagttaacg tgtttgagtt tctgaaataa gtgctttcat tatgtatcta 1560 

<210> 27 
<211> 310 
<212> PRT 

<213> Qryza sativa 
<400> 27 

Met Glu Gly Ser Ala Gly Ala Pro Leu Arg 15ir Az^ Leu Cys He He 
1 5 .10 15 

Gly Ser Gly Pro Ser Ala His Ihr Ala Ala He Tyr Ala Ala Arg Ala 
20 25 30 

Glu Leu Lys Pro Val Leu Phe Glu Gly Trp Leu Ala Asn Asp He Ala 
35 40 45 

Ala Gly Gly Gin Leu Uir Ihr Ihr Ihr Asp Val Glu Asn Phe Pro Gly 
50 55 60 

Phe Pro Glu Gly He Leu Gly Gly Glu Leu Met Asp Arg Cys Arg Ala 
65 70 75 80 
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Gin Ser Leu Arg Phe Gly Thr Ser lie lie Ser Glu Thr Val Ifcr Ala 
85 90 95 

Val Asp Phe Ser Ala Arg Pro Phe Arg Val Ala Ser Asp Ser Ibr Thr 
100 105 110 

val Leu Ala Asp Ala Val Val Val Ala Thr Gly Ala Val Ala Arg Ara 
115 120 125 

Leu His Phe Ala Gly Ser Asp Ala Tyr Trp Asn Arg Gly He Ser Ala 
"0 135 

Ala val Cys Asp Gly Ala Ala Pro lie Phe Arg Asn Lys Pro lie 
150 155 

Ala Val He Gly Gly Gly Asp Ser Ala Met Glu Glu Ser Asn Phe Leu 
165 170 175 

Thr Lys lyr Gly Ser His Val Tyr He He His Arg Arg Asn Ihr Phe 
180 185 190 

Arg Ala Ser Lys He Met Gin Ala Arg Ala Leu Ser Asn Pro lys He 
195 200 205 

^fi Pl^e Trp Asp ser Glu Val Val Glu Ala Tyr Gly Gly Glu Gly 
^-^u 215 220 

Gly Gly Pro Leu Ala Gly Val Lys Val Lys Asn Leu Val Ihr Gly Lys 

230 235 240 

He Ser Asp Leu Gin Val Ser Gly Leu Phe Phe Ala He Gly His Glu 
245 250 255 

Pro Ala Ihr Lys Phe Leu Gly Gly Gin Leu Glu Leu Asp Ala Asp Gly 
260 265 270 

Tyr Val Ala Ihr Lys Pro Gly Ser Thr His Ihr Ser Val Lys Gly Val 
275 280 285 

?Jn Asp val Gin Asp Lys Lys Tyr Arg Gin Ala He Ihr 

290 295 300 

Ala Ala Gly Ser Gly Leu 
305 310 



<210> 28 
<211> 32 
<212> ENA 
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<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: 
oligonucleotide (oligo P9) 



<400> 28 

gactaagctt acaattatta taitcaaaatg gc 



32 



<210> 29 
<211> 21 

<2i2> rm 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide (oligo PlO) 

<400> 29 

gcttttccca atacgcaatg c . 21 



<210> 30 
<211> 31 
<212> ENA 

<213> Artif icieil Sequence 



<210> 31 
<211> 40 . 
<212> E8SIA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: 
oligonucleotide (oligo PI) 

<400> 31 

cgtaggatcc accatggctg aagaagaggg tcaggttgtc 40 



<220> 

<223> Description of Artificial Sequence: 
oligonucleotide (oligo P4) 



<400> 30 

gactagcgct gacagaaact gatgctagga a 



31 



<210> 32 
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<211> 35 

<2i2> mh 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequeace: 
oligonucleotide (oligo P2) 

<400> 32 

cgtagagctc tcaagaagaa gcagcagcag cagat 

<210> 33 
<211> 29 
<212> rasiA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide (oligo P5) 



<400> 33 

gactagcgct gaagagggtc aggttgtcg 

<210> 34 
<211> 27 

<2i2> rm 

<213> Artificial Sequence 



29 



<220> 

<223> Description of Artificial Sequence: 
oligonucleotide (oligo P12) 

<400> 34 

cagtaggctt aaggaggttg caacgag 

<210> 35 
<211> 27 
<212> ESSIA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide (oligo Pll) 

<400> 35 

cagtcagctg aagagggtca ggttgtc 
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<210> 36 
<211> 30 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; 
oligonucleotide (oligo P27) 

<400> 36 

ctaggagctc tacatggtgt ccaccagcag 30 



<210> 37 
<211> 21 
<212> OSIA 

. <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: 
oligonucleotide (primer P28) 

<400> 37 

gcacggcttg gtggtgaatc c 21 

<210> 38 
<211> 22 

<2i2> rm 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide (primer P29) 

<400> 38 

ctcattctgg tccatcaatg tc . 22 



<210> 39 
<211> 33 
<212> iasiA 

<213> Artif icicLl Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide (primer P26) 

<400> 39 



17 



BNSDCXDID: <WO_0198S09A2^I_> 



wo 01/98509 ^ ^ PCT/EPOl/06918 

gactgtcgac tcaatcactc ttaccttgct gag 33 

<210> 40 
<211> 31 
<212> Usm 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide (primer P31) 

<400> 40 

gactggatcc aatggtctcg aaactcacaa c 31 

<210> 41 
<211> 20 
<212> mA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

oligonucleotide (primer thiorodoxubi 1603) 

<400> 41 

gcggtcgttc attcgttcta 20 

<210> 42 
<211> 20 
<212> EaSIA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

oligonucleotide (primer thiorodox 2364) 

<400> 42 

acgtgcttca cgatggtgtt 20 
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